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PREFACE TO THE REVISED EDITION 


While this book is presented as a revision of the book under 
the same name published by the senior author in 1943, it is actually 
a completely new piece of writing. Only an occasional paragrap 
and a few graphs and tables from the earlier book have been in¬ 
corporated in this one. Nevertheless, it resembles the earlier book 


in many respects. , , . . - 

Like its predecessor, this book is intended as a text for a one- 

semester introductory course. The authors have attempte o pro¬ 
vide an introduction to statistical inference which is modern intui¬ 
tive, and nonmathematical, although the coverage is limited by the 
scope of the book. An effort has been made to present the materia 
with such simplicity and clarity as to make it almost self-instructive 
The treatment is nonmathematical, formulas being presented 
economical descriptions of computational procedures and, in some 
instances, as a means of clarifying concepts. The formulas aie not 
derived, but the underlying assumptions are stated when those 
assumptions limit the conditions under which a formula may 

Pr °The y range of activities involved in the entire statistical enter¬ 
prise has been indicated as far as seems possible in a one-semester 
text. The purposes of statistical studies, and the collection, su - 
marization, and presentation of data are discussed in the hrst tou 
chapters. In the earlier book, and in many of the other current 
texts, these mattem are more or less taken for granted on the 
assumption that students understand these principles without m- 

trU Most of the book is concerned with statistical frequency dis¬ 
tributions in one and two variables. These distributions are an¬ 
alyzed by direct examination of their contents and by the use oi 
such summary measures as percentiles, means, stand arc eua ions, 
and correlation coefficients. It should be noted that m every 
instance measures which describe the distribution of the group are 
presented, first. Later considerable attention is given to measures 
such as percentile rank, standard score, norma ce'ia e, an 
tion from regression value, which describe the placemen o 

viduals with reference to the group. . , • . rvn i 

Chapters 13 to 16 deal with such topics as point and i te a 

estimation, tests of hypotheses, and power function, m s . ..... 
where these can be dealt with by the use of the normal probability 
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distribution or Student’s distribution. The authors have not felt 
it wise to thin down the explanatory material in order to make 
room for the binomial, chi-square, and F distributions. However, 
by including the z transformation for r, the correction for continuity 
in tests of proportions, certain facilitating tables, and certain non- 
parametric tests, the range of applications has been considerably 
extended. The Appendix contains a number of useful tables, one 
of which has not been published previously. 

The exercises in this text are intended to serve two purposes. 
Some are the usual t3 r pes of practice exercise, presenting applica¬ 
tions of methods under discussion. Other exercises are intended to 
classify meanings, to encourage the student in an active search for 
relationships, and to extend the development of concepts already 
presented. Because the sets of exercises are timed to come at 
particular stages in the expository treatment, it seems desirable to 
place them immediately after the exposition of particular topics 
rather than at the end of chapters or in a separate manual. Answers 
to questions in the exercises appear in the Appendix. 

In preparing exercises the authors have made repeated use of 

thp same two sets of data, so that the same data may appear in 

exercises in tabulation; graphing; computation of percentiles, 

means, and standard deviations; and determination of relationships 

between pairs of variables. The decision to use the same sets of 

data in many types of problems is based on the belief that it is 

easier for a student to comprehend a statistical concept if he is 

familiar with the subject matter of the data. It should be pointed 

out, however, that a great variety of additional data are included 

in the book to enrich the reader’s experience. In the interest of 

emphasizing statistical principles, a new idea is often introduced 

with a miniature problem in which the arithmetic is so simple that 

computation requires little attention. These short problems have 

been specifically planned to facilitate the development of meanings. 

bhort problems are also used occasionally to test the student’s grasp 
of principles. ^ 


Ihe discussion of tabulation and graphing has been greatly 
expanded. In Chapter 6 the discussion of percentiles has been sepa¬ 
rated from that of percentile ranks to emphasize the difference in 
the role of the two concepts. The treatment of regression and 
ton elation has been greatly revised to clarify the role of these con- 

ovnn^ T w P lc ; atl0ns * f Perha P s the greatest change has been the 

thP p d r d V' ea ! mCn f t ° f statlst,cal inference, from one chapter in 
the earlier book to four chapters here. 


New York 
March, 1958 


H. M. W. 
J. L. 



How to Use This Book 


This section is addressed in part distinction need be 

the students in this case are adults, however, 

^Physical Features of the Book of this book 

familiarizes himself with its physical stru • £ Tables and Figures, 

leh help economize student time and energyare the Usts of^ ^ ^ 

which follow the Table of Contents in th ^ read Belective l y rather than 
Glossary of Symbols in the Appendix^ 1 encounUred . The page on which 

sequentially, unfamiliar terms or «y^ , ^ !red by consulting the Glossary of 

theae are introduced and explamed can ^ o[ numbcring tables, figures, and 

Sit ^jrSusly trained themselv. to 

material may find it necessary to ^ ‘he. P ^ ^ a page at , gulp they 
to understand that in a work of this natu s , ow because the number o 

may feel a sense of frustration and that £* their average. Students mus 

pages covered in a given period of time is matter . while 60me materia 

learn to gear themselves accordingto the I material ^ for though ful 
may appropriately be covered at high P . consideration with earlier 

The following pattern of study is a gg ‘ Rea(] a chapter rapidly to get 

understanding for a given expenditure ^ purpose of the discussion, 

a general impression of the mater '“ k a chapter or two ahead of the class 
In this rapid preliminary reading^ taken up in class, go over it in detail, corn- 

discussion. (2) Just before a se ver if y i n g numerical statements, an< ma mg 

paring, rereading, solving pro ) ’ ^ ^ 800I1 as possible after a sec ion 

note of all points that are not «j y con80 , i(la te the unde,standings 
has been studied in class, read i ^ ^ rer cad the chapter once 

gained from the class discussion. ( > studied prcvio usly not only clarifies 

more. The practice of rereading |(j p oints that seemed completely 

one’s thinking but tends to .mprov ^ onc experiences a real sense of 

obscure on a first reading now seem ^ scemed quite simple on a first 

= ,b. o. i... «• '-c““ 

Students unaccustomed to sei ma „y times. They should 

couraged by their need to r „f material. 

realise that this is a '"mtic » „ ( statistical method is to abstract from 

Abstract Imagination. 1 ne pun ^ 
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a mass of individual observations certain information which will enable one to 
comprehend the whole. Therefore, statistical measures are essentially abstrac¬ 
tions. An understanding of such abstract concepts has to be built up by experience, 
just as one builds up the concepts suggested by the words “quaint,” “patriotism,” 
“irresponsibility,” “nevertheless,” and “implication,” for which no photograph or 
diagram can suffice. These concepts must grow in the imagination. To stimulate 
such imagination is the purpose of certain sets of artificial data used throughout 
the text and of certain developmental exercises., both verbal and numerical. The 
building of concepts is a more important undertaking than any other aspect of the 
course, because without a clear comprehension of concepts all skills are useless. 

The beginner does not know which concepts can be clarified graphically and 
which cannot, and sometimes he is needlessly troubled by his inability to “see” 
a statistic (e.g., the variance) for which he can obtain no satisfactory visual image. 
If a student is uncertain about the existence of a useful visual symbol, he might 
well take the responsibility of asking his teacher before he indulges in too much 
worry. 

Mathematical Preparation. The extreme diversity of mathematical back¬ 
ground usually found in a beginning class creates problems for both the teacher 
and the student. This text is written primarily for the person whose study of 
mathematics ended when he left high school. At points where such persons are 
likely to experience difficulty, either special explanations are provided or references 
are made to Walker’s Mathematics Essential for Elementary Statistics (Holt, Rev. 
Ed., 1951). Teachers who use this book may want to require each student to take 
the self-scoring tests with which each chapter begins. Students who make no 
errors on a test need do nothing further with that chapter. Those who make errors 
should study the explanatory material and the practice exercises, then take a 
second self-scoring test parallel to the first one. Each student can go through the 
book at his own speed, reporting each week on the chapters he has studied and the 
number of errors made or t uch test. A student with good mathematical back¬ 
ground can finish the boot, in a few hours. The student who cannot needs the help 
he will secure from its use. By this method practically no class time is taken up 
with difficulties that relate to the simpler aspects of mathematics. 

Order of Topics. The sequence of development followed in this book is not 
the only one which might successfully have been adopted. However, if a person 
should attempt to study this book in a different sequence, rearranging the topics 
in some new order, he will almost certainly suffer the frustration of encountering 
the use of technical terms and concepts before he has studied the basic explanation 
clarifying them. The authors have devoted a great deal of thought to the arrange¬ 
ment of topics and have consistently used the earlier sections to prepare for later 
ones. Most of the accruing advantages will be lost if the sequence of topics is 
drusticullv revised. However, the ensuing confusion can be somewhat mitigated 
by consistent reference to the Index and Glossary of Symbols. 

Minimum Course. Not all classes will be able to complete this text in the time 
at their disposal. If some material must be eliminated, we suggest that Chapters 1, 
2, and 4 be rapidly skimmed, and that the following be eliminated altogether 
the last ten pages of Chapter 12; the last part of Chapter 14 beginning with 
“Power of a Test,” and all of Chapters 11, 15, and 1G. 
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The Rok o 


Statistical method is one of the devices by which men try to un¬ 
derstand the generality of life. Out of the welter of single: events 
human beings seek endlessly for general trends; out of the vast 
Id confusing variety of individual characters, they eontmually 
search for underlying group characters, for some picture of the 
group to which the individual belongs. This group picture 
not merely a summary of the individuals which form the group. 
It transcends the individuals. It has meaning of its own not to be 
discovered through the most intense contemplation of any single 
individual. Such group trends are sometimes apprehended su .- 
iectively and almost intuitively by a person of penetrating insight 
who observes, without the aid of numerical computations, that 
5is group is more variable than that” or ‘‘these wo character¬ 
istics are not likely to occur in the same individual, or the like. 
f ‘ , statements are fundamentally statistical in nature, because 
Uiey rehite^totendencies of a group as a whole which either have 
no meaning with reference to a single individual or may not be 
true of a particular selected individual. That such a statement s 
vague subjective, debatable, quite possibly false, and inefficient ) 
arrived at does not keep it from being statistical in nature Con- 
trolled objective methods by which group trends are abstracted 
from observations on many separate individuals are called sta- 

tl 8 t StotUti«°ta Modern Life. To a very striking extent our cul¬ 
ture has become a statistical culture. Even on the most elementary 
level it is impossible to understand psychology, sociology, econom¬ 
ics finance or the physical sciences without some general idea of 
2 meaning f an average, of variation, of relationship, of sam- 
pUng of how to read charts and tables. Even a person who has 
never heard of an index number is affected in the intimate details 
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of his daily living by the gyrations of those index numbers which 
describe the cost of living. Perhaps the clearest indication of the 
extent to which statistical ideas pervade modern life is provided 
by the use which advertising firms make of such ideas. Quoting 
statistics in favor of almost any product they wish to popularize, 
even if those statistics have little relation to the quality of the 
product, and making liberal use of graphs, they tacitly assume that 
the American people are accustomed to thinking in statistical 
terms or that if they do not understand they may at least be im¬ 
pressed by a statistical argument. 

Let us look briefly at some of the ways in which statistical 
facts and statistical ideas impinge on the life of an ordinary person. 
This business begins early. The parents of a newborn baby are 
soon asking how his rate of development compares with that of 
the “average” child, and are perhaps quite unaware of the large 
amount of statistical research required to furnish the norms needed 
to provide an answer for their queries. For the baby’s ills as well 
as their own, they expect the doctor to have available tested rem¬ 
edies. But such testing is the outcome of the statistical analysis 
of data obtained from large-scale experiments. They expect to 
be able to buy clothing and shoes accurately sized for children of 
various ages. But how could a manufacturer get the information 
which makes such sizing possible unless measurements had been 
made and statistically analyzed for large numbers of children at 
each age level? 

Perhaps the parents take out an insurance policy to provide 
for the eventual college education of the new baby. An insurance 
policy actually represents a gamble in which the individual bets 
that he will not live a specified number of years or that some stated 
disaster will happen to him, and the company bets that the disaster 
will not occur. Rates that will permit the company to stay in busi¬ 
ness but not to make excessive profits can be set only by a very 
large-scale continuing analysis of mortality data, accident data, 
and the like. It is impossible to foretell or control the chance 
factors in life, but it is possible statistically to estimate their group 
impact and by spreading costs over many people to mitigate the 
effect of calamity on any one person. 

In our “average” family, money is probably not plentiful and 
they watch the cost-of-living index anxiously, although they may 
have no idea of how the Bureau of Labor Statistics works year in 
and year out to gather the statistical data on which that index is 
based. Or they may watch apprehensively the published figures 
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on unemployment, without knowing anything about the enormous 
sampling project conducted by the Census Bureau and the Bureau 
of Labor Statistics in order to obtain current information on t 
extent of unemployment in various industries and various regions. 

When the erstwhile baby is of school age h.s parents expect 
the school to obtain and make proper use of information about 
aptitudes, abilities, progress. Yet without the psychologies and 
statistical labor of many people over many years, there would oe 
no tests of reading readiness, of intelligence, of achievement, 

V ° C Justus parents expect the medical profession to have reliable 
information about the effects of all kinds of drugs and treatments, 
old and new (information that can be secured only througn large- 
scale experiments, statistically analyzed), just so do they take 
for granted that the teaching profession will have tested informa¬ 
tion concerning how children learn; how they develop physnalh, 
mentally and emotionally; which school practices are beneficial 
to children’s development. Such knowledge requires the observa¬ 
tion of many children, with statistical analysis to extract meaning 

fr ° PotulaUonTncreases or population shifts will affect thegiving 

arrangements and personal comfort of our a\e g - . 

Statistical information concerning population changes is essential 
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authorities public utilities, and school boards, thus the a\e 
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the effectiveness of an advertising campaign, which are also based 
on samples. 

The teacher who does not understand certain basic statistical 
ideas is in a position to do real harm to pupils because of his 
ignorance. If he is not aware of the universality of human vari¬ 
ability and of ways of measuring it, he may use a test norm as a 
standard to which he tries to make all pupils conform. If he does 
not understand the approximate nature of all measurement, the 
conventional measures of test reliability, the qualities to be sought 
in selecting standardized tests, and methods of determining the 
reliability of teacher-made tests, he may place too much faith in 
unreliable measures of pupils. If he has never studied correlation 
and regression, he - may confuse correlation with causation and 
thus propose irrelevant action, or he may assume that the child 
with the highest intelligence quotient in his class should be ex¬ 
pected to stand highest in other desirable traits and therefore be 
censured if he does not excel, thus demanding the impossible of 
some pupils and giving insufficient challenge to others. 

The student majoring in education, psychology, sociology, or 
economics soon discovers that, unless he has at least a minimum 
acquaintance with statistical vocabulary and statistical method, 
much of the important literature of his field is incomprehensible 
to him. Often he discovers that he himself has become interested 
in working on some project to which statistical methods are basic. 

Everyone who reads the newspapers is now familiar with the 
various opinion polls and so has some vague idea about generaliza¬ 
tions based on samples. The educated person needs to understand 
something about the practice of drawing samples and making 
inferences from sample data, and to become sensitive to sources 
of bias in any sample on which he is depending for important in¬ 
formation; he needs to know that sampling has become a highly 
technical matter and that, if he plans to make an important in¬ 
vestigation utilizing sampling, he must either study the literature 
on sampling methods or consult an expert, preferably both. 

Persons who do not feel confident of their own ability to inter¬ 
pret a statistical statement usually resort to one of two extreme 
positions, each unsatisfactory to the well-educated man. One 
extreme is the uncritical acceptance of any statement buttressed 
by statistical data, however fantastic. The other extreme is un¬ 
critical suspicion of all statistical reasoning. Aside from these two 
extremes, the only alternatives appear to be either reliance on 
authority or some comprehension of what statistical reasoning is 
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most medical research. The immediate decision may be the choice 
between saying “yes” or saying, “The evidence is not convincing” 
to such a question as, “Do cigarette smokers have, in general, a 
shorter life-span than nonsmokers?” or “Does the Salk vaccine 
protect against polio?” But that decision may cause many in¬ 
dividuals and groups to make other related decisions as to a course 
of action. 

Action Suggested by Statistical Inquiry. A statistical investi¬ 
gation is not limited to providing a basis for choosing between 
alternatives which are known prior to the investigation. The pur¬ 
pose of the inquiry may be to suggest a new course of action. Some 
of the basic problems confronting our society, such as delinquency 
and alcoholism, do not yet have well-defined solutions even as 
possibilities. It is hoped that statistical studies underway on 
factors related to these problems will suggest remedial courses of 
action. 

Basic Statistics. A large part of statistical enterprise concerns 
itself with compilation of statistical data. — its summarization and 
presentation as a basis for a variety of decisions. In business, 
statistical data deal with such matters as items produced, labor 
used in production, sales, and profits or losses. In government, 
statistical data are concerned with such matters as size and compo¬ 
sition of population, and wealth of the nation and its welfare. 

The purposes to be served by the basic data are not always well 
defined at the time of compilation. Data obtained from a census 
may serve as a basis for revising congressional districts; they may 
help a chain store in establishing new branches; or they may 

point out the need for new facilities for higher education ten or 
fifteen years later. 

In the presentation of basic statistics, consideration needs to 
be given to the uses to which they will be put in making decisions 
leading to action. A great deal of data, such as population or 
wealth of many localities, is presented in the form in which they 
were obtained. The users may then relate and summarize these 
data in ways which are most helpful to them. 

Data for the nation as a whole, or for states, are often of suffi¬ 
cient interest so that governmental agencies present summaries of 
them in a variety of ways. As an illustration of such a summariza¬ 
tion, consider the consumers’ price index prepared by the United 
States Bureau of Labor Statistics. This is a weighted average of 
retail prices expressed as a percent. This index of prices is very 
important in interpreting wages and costs under the conditions of 
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varying prices. Other well-known summaries of this sort are 
indices of stock prices and of production. 

Summary of Purposes. Statistical investigations are justified 
by their usefulness in leading to improved courses of action. Some 
investigations are directly related to a specific purpose and are 
planned accordingly. Other investigations do not have clearly 
defined purposes, but it is expected that the information obtained 
in these investigations will serve a variety of purposes. 

Issues Preliminary to Any Statistical Inquiry. At this point 
the reader may be mistakenly assuming that by studying a text 
in statistics he will learn certain statistical computations and a 
kind of universal routine for attacking a statistical problem, and 
that he can then apply that routine and perform those operations 
in any situation he wishes to explore statistically. This is a serious 
misapprehension of the way a statistical study should be ap¬ 
proached. Before any statistical enterprise is begun, whether it 
be on a large scale or a small scale, whether its pattern be simple 
or complex, certain general issues must be faced. Decisions must 

be reached on the following points: 

(1) What concrete questions is the study designed to answer? 
The ability to ask good questions is one of the marks of an effec¬ 
tive, imaginative, creative research worker. Help at this point 
must come primarily from the content field rather than from 
methodological consideration. 

(2) To what class of individuals are the answers intended to 
apply? To those individuals examined only or to some larger 


group? 

(3) If the answers are to have some general import beyond the 
cases actually observed, how is the sample to be chosen? How 
large should it be? Are there any elements of bias in the proposed 
plan for selecting the sample of individuals to be examined? 

(4) What observations or measurements arc to be made on 
these individuals, and how? Each content field has its own special 
methodology for making observations and measurements, and this 
methodology must be learned by the specialist in that field. In 
this text only general matters related to a wide variety of fields 


can be mentioned. 

(5) How shall data be assembled for analysis? 

methods are described in Chapter 2. 

(0) What summary measures shall be computed? 

main theme of the book. 


(7) How shall results be presented? Some aspects 
lein are discussed in Chapters 3 and 4. 


Some useful 
This is the 

of this prob- 
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Gathering and Recording 
Data 


Once the purpose of a statistical investigation has been defined, 
the problem is to collect data which are relevant to that purpose, to 
analyze these data, and to present them in a meaningful manner. 
This chapter will deal somewhat sketchily with problems of 
gathering and recording data in order to facilitate their analysis 
and interpretation. 

Subjects of the Investigation. Statistical data may be con¬ 
sidered broadly as observations made on subjects. Subjects differ 
vastly from one inquiry to another. Sometimes subjects are 
individual persons, sometimes they are families or institutions 
or communities, sometimes they are physical objects such as build¬ 
ings or machines or samples of a manufactured product. The 
variety of things which may be made the subjects of inquiry is 
practically endless. 


Before the investigation begins, it is necessary to define the 
units about which investigations are to be made. For example, 
in planning a study of low rent public housing developments in a 
city, it would be necessary to decide whether observations are to 
be made on families, on persons, on dwelling units, or on some 
other type of individual. If the family is selected as unit, any of 
the following definitions might be selected, as well as many others: 
(a) a family moving into one of these developments during the 
period covered by the study; (b) a family living in one of these 
developments for any portion of the period covered by the study; 
(c) a family applying for entrance to the development during the 
period covered by the study. It would also be necessary to define 
“family.” If single persons are accepted, does a single person 
constitute a “family” for the purpose of the inquiry? Do two or 
three single persons living in the same apartment constitute one 
iamily. If so, does that family have a “head” to be enumerated 
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when the heads of families are classified? If the person who signs 
the lease has relatives other than wife and children living in the 
same apartment with him, is there one family or two? Using an 
apartment in one of the developments as unit would permit mak g 

observations on all the persons living in it during; a h ^ ‘ ot 
period thus securing information about turnover which could not 

be obtained from a study of family units Almost e " er y StU ^ 
presents similar perplexities of definition which must be resolved 
before the data are gathered. Each unit or subject is also called 
an individual, and sometimes an element. Obviously the term in 
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ardized test would be variables. a boys scbool, 
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private, as sectarian or nonsectarian , I * ‘ classified as 

enous or endogenous; candidates for an office 

winners or losers; and so on and on. 
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Variables which can take three or more values can be classified 
with respect to whether their possible values have a necessary order. 
Thus automobiles might be classified according to such ordered 
variables as price, number of miles they have been driven, condi¬ 
tion of the tires, and year of manufacture; or according to such 
unordered variables as make, state in which they are currently 
registered, and company with which they are insured. College 
students might be grouped in ordered classes on such variables as 
age, number of siblings, scores on a prognostic test, tuition paid, 
and popularity; or in unordered classes on such variables as place 
of birth, type of college attended, and department in which enrolled. 

Ordered variables may be classified as to whether their values 
have been placed on a numerical scale or .have only been described 
in a series of ordered categories (or classes). Of the variables men¬ 
tioned in the previous paragraph, condition of the tires of an auto¬ 
mobile might be described in such ordered but unsealed categories 
as: (a) beyond all usefulness, (b) badly worn, (c) slightly worn, 
(d) very good, (e) entirely new. Popularity of college students 
might be described in such ordered but unsealed classes as: 
(a) most popular, (b) popular, (c) not well liked, (d) greatly dis¬ 
liked. The other ordered variables named in the preceding para¬ 
graph are scaled variables. 

Scaled variables may be classified as to whether the scale is dis¬ 
crete or continuous. Thus age and height are continuous variables 
because they can take any value — integer or fraction — along 
the scale, which is continuous, without a break. On the other 
hand, number of siblings is a discrete variable. The values of 
the variable must be integers and cannot take any of the fractional 
values between the integers. 

Sample or Entire Population. The question as to whether ob¬ 
servations are to be made on all the individuals with whom the 
study is concerned (i.e., the population) or only on some sample of 
those subjects must be decided in each situation on its own merits. 
An example of an inquiry in which the goal is to make observations 
on all individuals is the United States Census. On the other hand 
when norms are to be established for a mental test, it is obvious 
that it is neither necessary nor economical to attempt to measure 


all children of the appropriate age in the country. Even if that 

could be done, the test would be used for children born at a later 

time and so not included in the original study. Clearly, the group 

on which test norms are established must be considered a sample 
of a larger population. 


instruments for making observations ” 

The procedures of sampling in experiments and in surveys are 
treated in several excellent textbooks, and are beyond the scope of 
Jiis book However, an indication of the general logic of sampling 

^'instruments forsaking Observations. If observations are to 
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Personnel of the Investigation. In some enterprises carried 
out by a small team and in many individual studies, the same per¬ 
son or persons determine the goals of the study, plan its design, 
and execute the details. However, in a governmental agency or 
in a large business enterprise, the purpose of the investigation is 
likely to be decided by higher administrative officials of the or¬ 
ganization, and the conduct of the investigation is likely to be 
turned over to technicians who work in consultation with the 
administrative officials. 

The planning of the investigation and the construction of in¬ 
struments require skill of very high order. Specialists in one or 
more subject-matter fields and in statistics participate in this 
aspect of the investigation. The actual use of the instruments in 
making observations may often be left to personnel with more 
limited training. Such personnel may have a variety of titles, 

• 

depending upon the area of work. In industry, they are often 
called inspectors. In census work, they are called enumerators. 

The statistical analysis of the data is carried on under super¬ 
vision of persons with advanced training in statistics. However, 
tabulations and computations may be performed by clerical work¬ 
ers and by operators of the various mechanical devices used in 
statistical work. The preparation of the final report is again likely 
to be an enterprise which is carried out by analysts of the highest 
skill. 

Source of Observations. In many statistical undertakings the 
observations are made specifically for the purpose at hand; in 
fact, the instruments are devised for the sake of the investigation. 
However, many other investigations use observations which have 
been accumulated as operating records of a public agency or a 
commercial organization. The problem of devising instruments is 
then minimized or even eliminated, but the problems of sampling, 
organizing data, analyzing, and reporting still remain. 

Original Records. For observations to be useful in a statistical 
investigation, it is necessary that they be recorded. Important 
considerations in the planning of forms for the original recording 
of data are: 

Accuracy of the record 

Simplicity of recording 

Identification of the subject 

Accuracy and ease of reading the record 

Simplicity of use in further analysis 

Completeness of record in respect to its later use 
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The time card in Figure 2-1 satisfies all the requirements here 
listed. Accuracy of the record is assured by an exact time mech¬ 
anism which automatically stamps the time. The record is made 
simply by inserting the card into the mechanism. The pressure 
of the card operates the mechanism. The employee is clearly 
identified. Since the record shows the date, hour, and minute 
stamped in a preassigned position, it is easy to read. The tabular 
arrangement simplifies computing time for pay-roll purposes. 


BROWN, JONES, SMITH & CO. 

TIME CARD 

Week beginning August 4, 1958 


Roe, Richard 
Packer 

Shipping Department 

Time Time 
in 

8:30 
8:31 


Mon. 

Tues. 

Wed. 

Thurs. 

Fri. 


out 

12:00 

12:00 


Time 

in 

12:55 

12:55 


Time 

out 

5:00 

5:00 


Fig. 2-1 Time Card. 


An IBM answer sheet for a test in multiple-choice form is 
shown in Figure 2-2. This answer sheet requires further process- 
irT An electric scoring machine will give the number of correct 
answers either for the entire sheet or for a specified portion of it. 

The examples which have been given provide a separa c form 
for the original record of each individual, but often a single form 
usTd for a group of individuals. The sheets used m census 
enumeration hL 40 lines on each side, all the data for one person 
being placed on a single line of that sheet. One advantage of sue h 
recording is that if an enumerator in the field handles only a few 
large sheets, he is less likely to lose one than if he handles many 

8I " The h Snal record form may also provide for recording sub¬ 
sequent action on the basis of the information. Bills customarily 
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have space for endorsement as having been paid. Applications 
for employment may also provide space for later evaluation, as 
approved or disapproved for employment. 
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2—2 IBM Answer Sheet for Multiple-choice Test. 

Summary Records. Frequently data on several variables for 
the same individual, gathered on separate original records, must 
be brought together on one single summary record. A familiar 
example of such a summary record is a teacher’s record book, 
where one line is allocated to each pupil and various columns to 
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THE PUNCHED CARD 

the tests The original test records from the answer sheets of the 
pupils are transcribed into the appropriate spaces in the class book. 
At the end of the school term the various test scores are average . 
The class book is also likely to have a space for a final grade. 

jr * 

“z -jsrst; "sss 

one card if t^e are a great many ^ by punching a hole 

this card any digit from 0 number consisting 

in the space identified with that digib o _ ^ column [m 
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each digit of the number. In designing i 
. c Jktvvj m, 

\nz~~' —r. . 


11 i 


r»»n 


I.' V M 


ij jpn jnlnn>>»>1 >> 1 > 11 " 1 ’ 1 ’ 11 1 „ J...,,i,hh|ihim«**"« 

• * I L.. . .1... ill J«*«««*««4««‘I* 4 i 4 4 4 4 t*" * J--.-J--- 

mi 












|£ gathering and recording data 

record of a very large amount of information, and the novice will 
find it a great economy to submit his instruments to such an expert 
before he gathers his data. Without expert advice in advance , one 
may incur much avoidable expense in transcribing data from the 
original instruments to the punched cards. In some of the col¬ 
umns, letters of the alphabet can be recorded, but these are used 
in reporting results, not in analysis. This type of card is used in 
connection with equipment made by the International Business 
Machines Company (IBM). 

The electric equipment can be used to sort cards into classes, 
to count cards, to list some or all the entries, and to make a variety 
of complex computations. Such equipment is expensive to rent 
or to buy and is customarily used only in an organization with a 
volume of work sufficiently great to warrant the expense. Card 
equipment which is much cheaper to buy will be described in the 
following paragraphs. 

Edge Marking Cards. Even if the size of a study does not 
justify the use of electric equipment, it may still be convenient to 
record all the data for one individual on a single card and to sort 
these cards by hand into piles according to the classes for a par¬ 
ticular variable. Even sheets of paper such as test blanks are 
sometimes sorted thus into piles. The mechanics of sorting is 
facilitated by placing easily recognized symbols in designated lo¬ 
cations on the cards. Thus, for example, a red check mark in one 
corner of the card might indicate a boy and a blue check mark a girl. 

The procedure of sorting can be further facilitated by making 
marks at designated positions at the very edge of the card. A code 
is drawn up showing exactly which of the numbered positions repre¬ 
sent the field for each variable and a record on the individual card 
is made by drawing a line from a numbered position to the edge 
of the card. 

Figure 2-4 shows the face of a Thurstone Edge Marking card. 
It is 5 by 8 inches, with 65 numbered positions on each side and 
space in the center for identifying data or written comment. There 
are many ways to fill out and use such a card. As a very simple 
illustration, suppose position 30 is to be used for recording the 
Yes-No answer to some question. For persons who say Fes, a 
black pencil could lie used to continue the short line in position 30 
to the edge of the card; for persons who say No, a red pencil could 
be used; for persons who fail to answer, the line could be left 
blank. . If a variable can take more than two values, a more com¬ 
plicated scheme is required. 
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EDGE PUNCHED CARDS 

After each card has been marked, the cards are stacked with 
all edges precisely even. Then by carefully riffling the cards, the 
markings for any one variable can be exposed in such a way that 
the cards can be very easily sorted into piles according to the 

categories for that variable. 1 



Edge Punched Cards. Such cards represent a mechanical im¬ 
provement over the edge marking cards in that they can be more 
efficiently sorted. Cards with a row of little holes around the edge, 
or sometimes two rows, can be purchased in various standard 
sizes as illustrated in Figure 2-5. If a study is to involve a very 
large number of individuals, it may be economical to have a card 
specially printed by one of the commercial firms handling such 

materials. , , „ . . . 

Data are recorded by notching the card, from a hole to the 

edge. A special clip must be used for this, as it is important that 

the notch be made smoothly and accurately. If a variable has 

only two values, a single hole can be assigned to that variable- 

then a notch may represent one value and the absence of a note 1 , 


i The Thurstone Edge Marking cards, together with a manual for theil‘ Use can 
be obtained from the Psychological Corporation, 522 tifth Avenue, New-Yo■*, N.V 
The content of thin manual originally appeared ^ a pa^r by L. 

Inurmil of the American Statistical Association, 43 (UM8), 4j1 •«**■ S<‘ •»'*» ' * 

marking of Statistical Cards” by A. M. Letter, Journal of the Amcncan Statistical Asso- 

cicition, 44 (1949), 293-294. 
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GATHERING AND RECORDING DATA 



Fig. 2-5 Example* of Edge Punched Card*. 


another. If a variable has more than two values, a field of several 

' :oles ma y have to be assigned to the variable. When cards of 

a certain category are to be selected, a needle is passed through 

the appropriate holes of the entire batch of cards. As the needle 

is lifted, the cards not classified in the category are draw’n out with 

the needle, the other cards which have been notched remaining 

behmd. More elaborate systems for complex classifications with 

edge punched cards arc described in the technical literature on this 
subject. 
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USE OF RECORDED DATA 

Use of Recorded Data. Gathering and recording data are the 
first steps in a statistical investigation, as they are, in fact, in any 
scientific enterprise. To make the recorded data useful to the 
investigation, it is necessary to select pertinent sections of the 
data and to analyze them in a variety of ways. The decision as to 
which data are to be selected for further analysis depends on the 
special purposes of the study. The remaining chapters of this 
book will be concerned with methods of analysis of data which 
are common to many statistical investigations. 



Construction and Use of 

Tables 


One of the simplest and most revealing devices for summarizing 
data and presenting them in a meaningful fashion is the statistical 
table. The table may be only an intermediate step leading to 
further analysis, or it may be an end product of the entire statisti¬ 
cal investigation. 

A well-planned table is a unified, coherent, and, in a sense, 
complete story about some aspect of a set of data. Elements of 
the data are set up in rows and columns so as to indicate impor¬ 
tant relationships. The story is carried by the title and by suitable 
phrases which describe the actual numbers in the rows and col¬ 
umns. The elements of the table and their functions will be de¬ 
scribed and illustrated in the following pages. 

The Table as a List. The simplest kind of table is a mere list 
of the subjects — usually in the vertical column at the left of the 
table — with certain facts about the subjects in the other columns 
of the table, all the facts for one subject appearing in the same 
horizontal line, or row. Such listings are illustrated in Tables 
VIII and IX in the Appendix. While in these tables the order in 
which the subjects are listed is of no particular importance, in 
many tables the subjects are placed in a meaningful order. Thus 
in a table giving data for the 48 states, the names of the states 
might be placed in alphabetical order for readers who wished to 
locate a particular state readily; in a historical study, the names 
might be placed in order of their admission to the Union; or, if 
the purpose of the table were to compare the states with respect 
to some characteristic such as density of population or per pupil 
expenditure for education, that characteristic might be placed in 
descending order. A list of candidates who have passed a civil 
service examination is usually arranged in the descending order 
of their examination scores because their relative position on this 
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determines their eligibility for 


ELEMENTS OF A TABLE 

list — their rank in the group - 
appointment to a position. 

If the data to be tabulated are on cards, with a separate card 
for each subject, these cards can be sorted in any desired order and 
then the data can be transcribed. If the original data already 
appear as a list, but a new list in some other order is desired, it 
may be helpful to prepare a set of cards, one for each subject, 
and then to sort these cards in the desired order prior to setting 
up the new list. A convenient feature of electric equipment for 
use with punched cards is that it makes possible an automatic 
listing, in tabular form, of the data on the punched cards. 

Summary Tables. A list is often a preliminary table, provid¬ 
ing a basis for other tables which summarize the information con- 
tained in the list and present it in a way which brings forward 
important facts and relationships contained in the data. The re¬ 
mainder of this chapter will be concerned with the methodology 

of constructing summary tables. _ . .... , . . 

The Elements of a Table. Table 3.1 is a simple table which 

summarizes and makes evident information contained in the list¬ 
ing of Table VIII about the sizes of Sections I, II. and III. Al¬ 
though it is a simple table, it contains the elements which char¬ 
acterize tables generally. These include: 

TABLE 3.1 Enrollment in Three Sections of o First Course in Statistical 

Methods 

Teachers College, Columbia University, February, 1950 


Section 


Number of students 


I 

41 

II 

27 

III 

28 

Not identified 

2 

Total 

98 




THE TABLE NUMBEli 
THE TITLE 


THE HEAD NOTE 


Table 3.1. 

Enrollment in Three Sections of a First Course in 
Statistical Methods. The title is a brief description 
of the contents of the table. 

Teachers College, Columbia University, February, 
1950. The head note may he used to provide addi¬ 
tional information about the fcentents of the table 
which would make the title undesirably long if in¬ 
cluded in it. 
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CONSTRUCTION AND USE OF TABLES 


THE FIELD 

THE COLUMN HEADS 

THE STUB 


LINE CAPTION 

8TUB HEAD 


The tabular entries, which in this table are the fre¬ 
quencies 4.1, 27, 28, 2 and 98. 

Section and Number of students. A column head 
describes the entries in a column. 

The left hand column, which contains the major 
scheme of classification of the tabular entries. In 
Table 3.1 the stub consists of the entries I, II, III, 
Not identified, and Total. 

Each entry on a line of the stub. Thus I, II, III, 
Not identified, and Total are line captions. 

A statement placed above the stub and used to 
describe the line captions in the stub. As the stub 
is a column, the stub head, Section, is also one of 
the column heads. 


The significance of these structural elements is more clearly 
brought out in Table 3.2 which will be discussed in some detail 
with reference to its structure as a table, and from the point of 
view of procedures needed to obtain the entries. 


TABLE 3.2 Number of Men and Women in Each Section of a First 

Course in Statistical Methods 

Teachers College, Columbia University, February, 1950 


Number of students 


Section 

Men 

Women 

Total 

I 

28 

13 

41 

II 

24 

3 

27 

III 

18 

10 

28 

Not identified 

1 

1 

2 

Total 

71 

27 

98 


Table 3.2 tells in greater detail the same story as Table 3.1. 
Because it shows another variable — namely, the sex of the stu¬ 
dents — two more columns are needed. There are now three 
columns in the field, and each column requires an explanatory 
heading. Instead of saying in this heading “Number of Men,” 
“Number of Women,” the phrase “Number of Students” is placed 
as brace heading (also called a spanner head) over all three columns. 
Note the use of a horizontal ruling to indicate the relation of the 
spanner head to the column heads. 

Tallying. The entries, or frequencies, in Table 3.2 could be 
obtained from the entries in the first two columns of Table VIII 
in the Appendix by counting all the entries for a given section 
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TALLYING 


which are unstarred in the first column, and then counting all the 
entries for the same section which are starred in the first column. 
This process could be repeated for all three sections. The results 
of these counts could then be checked against the complete counts 
for the sections made by ignoring the stars, as in Table 3.1, or 
by counts of all starred entries and all unstarred entries separately, 
ignoring sections. A check of some sort is absolutely essential 

for accuracy. . .... 

However, this method of obtaining the frequencies is laborious 

and likely to produce errors because it is so easy to lose count. 

It is better to obtain the frequencies by tallying. 

First, we note the nature of the breakdown desired for the 

final table, which, in this case, must provide for one classification 
into two groups (men, women) and for a second classification into 
four groups (three sections and a group of those not identified ). 
A diagram having 2 by 4 boxes or cells will be required and each 
cell must be large enough to hold all the tallies likely to be placed 
in it. Suitable headings should be placed on the rows and columns 
so that the person tabulating will not make a mistake by for- 


getting which is which. . 

Next, for each of the subjects in turn we enter in the appropri¬ 
ate box a short stroke called a tally (|). Thus for subject number 
1, a tally is placed in the box which is in row III and the column 
“Men ” The fifth tally in each set is drawn across the other lour 

IHI to facilitate counting the tallies. Sometimes the line for each 
tenth tally is slanted at a different angle to facilitate counting iy 
tens, thus: M W M IHf II- The diagram and the tallies which 

have been inserted in it are shown in Fig. 3 . 

Accuracy of tallying is considerably increased and eyestrain 
decreased if one person can read from the raw data while a second 
person enters tallies in the diagram. 


Fig. 3-1 Diagram Showing Tal- 
llei for Frequendet in Table 3.2. 


Section 

I 


II 


III 


Not 

Idontifiod 


III« 

M III 

M JHf 

III 

M M 1W.M 
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nr 

mm 

i 
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Table Containing Percents. Table 3.2 would be more informa¬ 
tive if the percents as well as the numbers of the men and women 
in the three sections were exhibited as in Table 3.3. Some distinc- 


TABLE 3.3 Percent of Men and Women in Each Section of a First Course 

in Statistical Methods 

Teachers College, Columbia University, February, 1950 


Section 

Men 

Women 


Both 

Number 

Percent 

Number 

Percent 

Number Percent 

I 

28 

68.3% 

13 

31.7% 

41 

100.0% 

II 

24 

88.9 

3 

11.1 

27 

100.0 

III 

18 

64.3 

10 

35.7 

28 

100.0 

Not identified 

1 

50.0 

1 

50.0 

2 

100.0 

All sections 

71 

72.4% 

27 

27.6% 

98 

100.0% 


tive structural features of this table merit consideration. The 
words “Men,” “Women,” and “Both” are used as spanner heads 
for brace heads ) and appropriate horizontal rules indicate the re¬ 
lation of each to the two column heads under it. 

The use of more than one kind of unit (numbers and percents) 
makes it imperative that the unit be named at the head of each 
column. To aid the eye in distinguishing which figures represent 
numbers and which represent percents, it is customary to place 
the % sign after the first figure in a column of percents and after 
the summary, if there is one. A similar convention applies to the 
use of the dollar sign. 

The introduction of percents requires identification of the base 
on which each percent is computed. The entries of 100.0% in 
the final column make it clear that each number in that column 
is the base of the percents in its row. Thus, = 68.3%, = 

31.7%, J-J- = 100.0%, and we have as a check 68.3%+ 31.7% = 
100.0%. Some authors record percents without indicating the 
numbers on which the percents are based. This practice is un¬ 
fortunate because it leaves the reader uncertain as to what sig¬ 
nificance to attach to the percents. Clearly 50 percent means 
something quite different when it is based on 200 cases than when 
it is based on 2 cases. 

In the final row of Table 3.3, the entries in the columns headed 
Is umber were obtained by adding the other entries in those 
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columns but the entries in the columns headed “Percent” were 
not obtained by adding other entries but by computing a new 
percent. Thus, 71/98 = 72.4% and 27/98 = 27.6%. Although 
the summaries for the columns beaded “ Number could properly 
be termed totals, those for the columns headed “ Percent cannot 
be so designated. To provide a heading appropriate for all entries 
the summary row has been labeled “All sections. 

EXERCISE 3.1 

1 From the data of Table 3.2, construct a table with percents based on 
the totals of the sexes instead of on the totals of the classes, as in 

2. From the same data, construct a table with column heads as follows. 


Number 


Percent 


Section 


Men Women Both Men Women Both 


TABLE 3.4 Number of Students in Each Section above or below the 

Middle of the Entire Group on Each of Two Tests 

A First Course in Statistical Methods. Teachers College, Columbia 

University, January, 1950 


Section 


II 


III 


All 

Sections 


Number of students scoring 


Sex 

On midterm test 

On arithmetic test 

Below 

53 

53 or 
higher 

Total 

Below 

33 

33 or 
higher 

Total 

Men 

Women 

Both 

11 

5 

10 

17 

8 

25 

28 

13 

41 

7 

3 

10 

21 

10 

31 

28 

13 

41 

Men- 

Women 

Both 

11 

3 

14 

13 

13 

24 

3 

27 

14 

2 

10 

10 

1 

11 

24 

3 

27 

Men 

Women 

Both 

12 

0 

18 

0 

4 

10 

18 

10 

28 

13 

8 

21 

5 

2 

7 

IS 

10 

28 

Men 

Women 

Both 

34 

14 

4K 

30 

12 

48 

70 

20 

90 

34 

13 

47 

30 

13 

49 

70 

20 

90 


Two cases with section unidentified have been on 


it led 
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A Table Containing Two Sets of Data. Table 3.4 is more com¬ 
plex than the preceding tables since it not only classifies the sub¬ 
jects by section and by sex, but also presents two sets of data on 
the same subjects — one set based on scores in a midterm test on 
the content of the course, and one on scores in a prognostic test 
in arithmetic given before the start of the course. The two sets 
of data are presented in a single table because the second set helps 
to explain the first. 

Column headings not only indicate the two separate parts of 
the table, they also manage to name the tabular entries in the 
table’s field. The stub is divided into four distinct blocks each of 
which designates a complete table, so that actually Table 3.4 con¬ 
sists of 8 subtables related in such a way that it is more useful to 
present them together than separately. 

Another feature of this table is that for the first time in this 
chapter use is made of continuous variables. These are the scores 
on the two tests. The choice of the two breaking points at 53 
and 33, respectively (noted in the column heads), may at first 
glance seem arbitrary. However, the entries in the final row of 
the table show that these scores divide the entire group into equal 
subgroups for the midterm test and into nearly equal subgroups 
for the arithmetic test. A procedure for finding such dividing 
scores will be described in Chapter 6. 

The frequencies in Table 3.4 were obtained by a simple sorting 
procedure which could be easily carried out by anyone concerned 
with similar data. For each individual, a small slip of paper was 
prepared and on that slip was recorded the individual’s code num¬ 
ber, section, sex, score on midterm test, and score on arithmetic 
test. Not all these data were used in the first tabulation but they 
were all recorded so as to be available for later tabulations. The 
slip used for Student 1 is indicated below 


1 III M 33 58 


The 9G slips of paper were filled out and checked for accuracy. 
I he slips were then sorted by midterm test score and arranged 
in descending score order. From this sort, it was ascertained that 
48 students scored 53 or above and 48 scored below 53. 

3 he. high (53 and above) and the low (below 53) subgroups 
were separated. Each subgroup was then sorted into six piles, 
by section and by sex within section. Obtaining the entries for 
the left-hand half of I able 3.4 was then merely a matter of count- 
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ing the numbers of slips of paper in the piles. The right-hand 
half of the table was constructed similarly, using the same 96 slips 

of paper. . . 

The analysis of the frequency distribution forms a major part 

of this book, and in effect the procedure just described was an 

elementary study of a simple frequency distribution. 

Although this chapter is concerned mainly with the structure 
of tables rather than with the content of any one of them, the 
content of Table 3.4 foreshadows so much of the reasoning in this 
book, and in the authors’ more advanced book on Statistical Infer- 
encc, that an analysis of this table seems desirable. An additional 
reason for the discussion which follows is that it gives an indication 

of how a table can be analyzed. 

First we may note that while the entire group is divided into 

equal subgroups by the score of 53 in the midterm test, thei same 

is not true of the Sections separately. In Section I three fifths 

the students scored 53 or higher, but in Section II fewer than half, 

and in Section III only a third-scored 53 or higher 

It is natural for the reader to wish to account for the differences 
among the sections. A first speculation might be that instruction 
was most effective in Section I and least in Section III. lion ever, 
before accepting this explanation, it is necessary to look at the data 
on the prognostic test of arithmetic. Interestingly enough lere 
also there are differences between the sections which parallel the 
differences observed in the other half of the Table. In Section I 
three fourths were among the highest half of the entire group 
but in Section II the corresponding fraction is two fifths, and in 
Section III it is only one fourth. It seems logical to assume that 
the observed differences in achievement of sections in the course 
may be partly or wholly accounted for by differences in mat 

matical ability of the students in these sections 

The relation between midterm score and arithmetic score for 

these 98 students, and the extent to which the former score can 

be predicted from the latter, will be taken up in Chapters 9 

aIld In°practice, data like those in Table 3.4 would be treated by 
more powerful statistical methods than those exhibited ... he 
table However, a tabulation like that appearing ... this table 
has various uses. It is helpful to the investigator in showing him 
direction for more extensive research. It is also helpful • 
playing to readers who are not technically trained the relation- 

ships discovered by technical means. 
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Summary of the Structural Elements of a Table. It seems de¬ 
sirable at this point to bring together in summary form a descrip¬ 
tion of the structural elements of the table. These structural ele¬ 
ments are illustrated in relation to Table 3.4 in Figures 3-2, 3-3, 
3-4, and 3-5. 



TABLE 3.4 Number of Students in Each Section above or below 
the Middle of the Entire Group of Each of Two Tests 

A First Course in Statistical Methods, Teachers College, 
Columbia University, February, 1950 


Number of students scoring 


Section Sex 


On midterm test 

Below 53 or Total 
53 higher 


On arithmetic test 

Below 33 or Total 
33 higher 



Footnote Two cases with section unidentified have been omitted 


Fig. 3-2 Major Parts of a Table. 


Such elements arc as follows: 

l. Heading. The portion of the table which includes the 
table number, the title, the subtitle, if any, the headnote, if any, 
the column headings, and column spanners. 

(a) Table number. If there are several tables in the same study, 
the tables should be numbered in sequence, either through the 
entire study or through each chapter. Reference to a table should 
usually be by its number. In referring to a table in the text, it is 
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better to say “In Table 15-” than to say “In the following 
table —" or “In the preceding table In preparing a manu¬ 

script the writer cannot be sure what will be the exact position of 
a table in the final printed version — whether it will actually fol¬ 
low or precede the reference - but its number is clear and un¬ 
ambiguous. 


Table 

number 

TABLE~3^4 Number of Students in Each Section above or below the j 

Middle of the Entire Group on Each of Two Tests j 

Headnote - A First Course in Statistical Methods, Teachers College, Columbia 

University, February, 1950 


Stub head —* Section Sex 


Number of students scoring 
On midterm test On arithmetic test 

v- y - 


Spanner 

head 




Column heads 


Fig. 3-3 Ports of the Heading. 


(b) Title. A clear description of the content of the table in 
as concise form as possible. It should help the reader understand 
quickly what the table is about. To begin the title with unnec 
essary words, such as “Table showing or “A comparison of 

the numbers of — 99 blunts its effect. . c 

M Subtitle A second title used to supplement a briefer main 

title Sometimes a table has a short display title and a longer, 

• Sometimes in <\ senes of similar 

more explanatory, main title, o , fusing 

subtitle and a headnote is not very impo which 

(d) Headnote. A statement or phrase below the tit c which 

serves to qualify the title or to provide information relating to 

rSbU al a whole. It may show units when alt e un,s m 

table are alike; it may show the source of the data. Wot every 

table needs a headnote. . • tivc title for a ll data in the 

column' whicTit heads."' Ideally, the column heading ”^ 
heading should name the tabular entries in the column. 
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Section 

Sex < 

i-Stubhead 

I 

Men 

Women 

<— Line caption 


Both 

<— Subtotal 

II 

Men 

Women 

Both 


III 

Men 



Women 

Both 

• <— Block 

All 

Men 


Sections 

Women 
Both * 

i-Grand total 

Fig. 3-4 Parts of the Stub. 


(f) Stubhead. The column head or caption of the stub, used 
to describe the stub listing as a whole. This is sometimes con¬ 
sidered part of the stub. 

(g) Spanner head. A classifying or descriptive caption spread¬ 
ing across and above two or more column headings, and applying 
equally to all columns thus covered. 

2. The stub. A description of the row classification appearing 
in the left-hand column. The stubhead is sometimes considered 
as part of the stub even though it is physically placed in the 
heading. 

(a) Line caption. A descriptive title for the basic data appear¬ 
ing on a line. As a line is also called a row, the line caption is also 
called row caption. 

(b) Subhead. A title which serves as a heading for several line 
captions. 

(c) Caption for summary line. A description of the group to 
which the summary applies. When the summary is merely the 
sum of several preceding entries, this caption is usually “Total” 
as in Tables 3.1 and 3.2. When, for a combined group, the sum¬ 
mary gives a percent or an average or similar measure not obtained 
by adding related measures for the component groups, as in Table 
3.3, the term “Total” does not apply, but instead some such 
phrase as “Combined group,” “All cities,” “All ages,” or the like 
is used. 

(d) Block. A segment of the stub consisting of several related 
line captions and subheads. 
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3. The field. The part of the table extending downward from 
the heading to the bottom rule of the table, and to the right of 
the stub. It contains the tabulated data of the table. 

(a) Cell. The space allotted for an entry. It lies at the inter¬ 
section of a column and a row. 

(b) Tabular entry. The information appearing in a cell ol 
the table. 


Column 


Fig. 3-5 



\ v / - 

6Ca !S 7Column. A vertical array of cells with a common classi- 

fiCa 4 i0r k footnote. Information needed for interpreting a table 
which is placed in a note directly under the table. This may^e 

information relating to the table as a w o e, su . { 

concerning its source. A footnote may furnish ani expWtmn o 
symbols used if their meaning is not obvious. WhenjipecrRc en 
tries in a table are explained by footnotes, reference s usuady 
made by means of such diacrit.cal marks as , , ^, or # or by 

means of small letters, a, b, c, . . . • ^ 

such reference because they might be mistaken for J “ 

tabular footnote does not become a par o ' e *= a ta ),i e 

footnotes used in the book or monograp . beneath its 

is not placed at the bottom of the page but directly beneath its 

‘^Tables with Averages as Entries. Tables 3.5 and 3.6 are pre¬ 
sented as examples of tables in which the entries are neither fre- 
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TABLE 3.5 Expectation of Life at Birth in the United States 


Year 




Expectation in years 





White 



Nonwhite 



All races 


Male 

Female 

Both 

sexes 

Male 

Female 

Both 

sexes 

Male 

Female 

Both 

sexes 

1900 

46.6 

48.7 

47.6 

32.5 

33.5 

33.0 

46.3 

48.3 

47.3 

1910 

48.6 

52.0 

50.3 

33.8 

37.5 

35.6 

48.4 

51.8 

50.0 

1920 

54.4 

55.6 

54.9 

45.5 

45.2 

45.3 

53.6 

54.6 

54.1 

1930 

59.7 

63.5 

61.4 

47.3 

49.2 

48.1 

58.1 

61.6 

59.7 

1940 

62.1 

66.6 

64.2 

51.5 

54.9 

53.1 

60.8 

« 65.2 

62.9 

1948 

65.5 

71.0 

68.0 

58.1 

62.5 

60.0 

64.7 

70,2 

67.2 

1949 

66.2 

71.9 

68.8 

58.9 

62.7 

60.6 

65.4 

71.0 

68.0 

1950 

66.6 

72.4 

69.2 

59.2 

63.2 

61.0 

65.8 

71.5 

68.4 

1951 

66.5 

72.4 

69.2 

59.1 

63.3 

61.0 

65.8 

71.5 

68.4 

1952 

66.6 

72.7 

69.4 

59.1 

63.7 

61.1 

65.9 

71.8 

68.6 

1953 

66.8 

72.9 

69.6 

59.7 

64.4 

61.7 

66.1 

72.1 

68.8 

1954 

67.4 

73.6 

70.3 

61.0 

65.6 

63.0 

66.8 

72.9 

69.6 


This table is taken from page 97 of the 1956 Life Insurance Fact Book , Institute 
of Life Insurance, New York, 1956. The Institute obtained the data from the 
National Office of Vital Statistics, Department of Health, Education and Welfare. 


TABLE 3.6 Expectation of Life at Various Ages in the United States 

1954 


Expectation in years 


Age 

White 

Nonwhite 

All 

races 

Male 

Female 

Male' 

Female 

0 

67.4 

73.6 

61.0 

65.6 

69.6 

20 

50.3 

55.9 

45.5 

49.4 

52.4 

40 

31.8 

36.7 

28.7 

31.8 

33.8 

45 

27.5 

32.2 

24.8 

28.0 

29.4 

50 

23.4 

27.7 

21.3 

24.4 

25.2 

55 

19.6 

23.5 

18.4 

21.3 

21.3 

60 

16.2 

19.4 

15.7 

18.3 

17.7 

65 

13.1 

15.7 

13.5 

15.7 

14.4 

70 

10.5 

12.4 

11.9 

14.0 

11.6 


This table is taken from page 97 of the 1956 Life Insurance Fact Book, Institute 
of Life Insurance, New York, 1956. The Institute obtained the data from the 
National Office of Vital Statistics, Department of Health, Education and Welfare. 
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quencies nor percents, but averages. Tables in which the tabular 
entries are standard deviations, percentiles, correlation coeffi¬ 
cients, or other summary measures occur in many scientific jour¬ 
nals and also in later chapters of this text. 

The figures in Table 3.5 are predictions, based on evidence 

available in the year indicated, as to what would be the average 
number of years in the life span for persons in the category in¬ 
dicated. Thus in 1900, it was predicted that for all white males 
born that year in the United States the average age at death 

would be 46.6 years. 

The figures in Table 3.6 are predictions, based on evidence 
available in 1951, as to how many more years, on the average, 
persons of a given age might be expected to live. Thus it is pre¬ 
dicted that for all white males, aged 50, the average number of 
years before death would be 23.0 and, therefore, that for these 
men who have already survived to age 50 the average age at death 

would be 73. 


EXERCISE 3.2 

1. Locate and name the structural elements in Tables 3.5 and 3.6. 

2. From Table 3.5 answer the following questions and indicate on what 
your answers are based: 

(a) By how many years did the average life span increase or white 
males during the period 1900-1954? Imr nonwhite males? Tor 
white females? For nonwhite females? Has the increase been 
greater for males or for females? For whites or nonwhites? 

(b) Was there any decade during this period in which the life expecta- 
tion was greater for males than for females. 

(c) Has the discrepancy between the average life span ° f ™ les “ d 
that of females increased or decreased during this period. Does 
your answer hold good for both whites and non whites. 

(d) Has the discrepancy between the average life span of 
nonwhites increased or decreased during this period? Does your 

answer apply both to males and to females. 

(e) Would it be correct to use the word Total as heading or^ the sum¬ 
mary columns here called Both sexes? As spanner heading instead 

of the phrase All Races? , . 

(f) Construct a new table in which the cloas.ficrxUons by by 

color are placed in the stub. Let the column headings b« l™, 
1920 1930 1940, 1950. Over these columns place a spanner head 
reading "Increase in life expectation during the decade beginning. 
From the given data make the necessary computations to obtain 

the tabular entries for this new table. 
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(1) In which decade occurred the greatest increase in life expects 
tion for white men? White women? Nonwhite men? Non¬ 
white women? 

(2) For all whites, which decade showed the greatest increase? 
For all nonwhites? 

(g) Was the rate of increase less in the decade 1940-50 than in the 
preceding decade? Is there some feature in Table 3.5 which might 
invite a careless reader to make a mistake on this point? 

3. Find in Table 3.6 data from which to answer the following questions: 

(a) What is the prediction as to the average age at death of all babies 
bom in the United States in 1954? Of all persons who are 20 years 
old in 1954? Of all persons who are 70 years old in 1954? 

(b) Is there any age at which the life expectation of women is less than 
that of men? 

(c) Is there any age at which the life expectation of white men is less 
than that of nonwhite men? Any age at which the life expectation 
of white women is less than that of nonwhite women? 

4. On a small slip of paper write for each student listed in Table VIII his 
number, his score in arithmetic, and his score on the midterm test. 
Sort these slips of paper into four classes to provide entries for the 
following tabulation: 

Arithmetic scores 
below 33 33 and higher 

53 and 
Midterm higher 

test - 

scores below 

53 

Construct a formal table using the tabulation just completed. Name 
the structural parts of the table. Discuss the significance of the con¬ 
tents of the table. 
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The use of graphs to make relationships among statistical data 
vivid is so well known that this chapter needs no justification. 
Some of the more common statistical graphs will be discussed 
here. Essentially, a graph is a representation of numbers by geo¬ 
metric figures drawn to scale. The effectiveness of a graph de¬ 
pends on the ingenuity with which relations among the geometric 
figures can be used to represent relations existing among numbers. 

The first consideration in drawing a graph, as in any under¬ 
taking, is its purpose. The most common purposes of graphs are 
representations of (a) numerical data, (b) partition of a total nto 
component parts, (c) changes over time intervals and(d ^elat.om 
ships among variables. Several purposes are often se. ved by 

“"The'second problem is the choice of geometric figures to repre¬ 
sent the numbers. Then the scale must be chosen F-naly the 
elements must be organized into the total graph. Jins organiz^ 

tion involves arrangement of the geometric g u ' > 

the scale, and formulation of a title. These problems will now 

considered briefly. . n u nvP the 

The Title. The title of a graph is sometimes p 
graph just as the title of a table is placed above the table, and >x 

sometimes placed below the graph just as ie i ^ j g l)e _ 

customarily placed below the picture. e or a „ pnc i cs the 
coming common with newspapers and advertising g •, 
latter still holds in most technical books and periodical n read 

ing either a table or a graph, one should 

then at such explanatory matter as t ie '° tabular cntr ics 

table or the scales of the graph, and , t j le tabular 

or the graph. In a table, the column 
entries; thus the title is most strate^i yp 
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In a graph, the scales may be above the graph, below it, or both, 
and so there is no clear reason why the title is uniformly better in 
one position than in the other. 

The Bar Graph. In this type of graph, numbers are repre¬ 
sented by bars drawn to scale. When the bars are in vertical posi¬ 
tion, the bar graph is sometimes called a column graph. The 
simple bar graph in Figure 4-1 provides a visual picture of the data 



Number of Students 


Fig. 4-1 Enrollment in Each Section of a First Course in Statistical Methods. Teachers 
College, Columbia University, February, 1950. 

in Table 3.1 on page 21. The scale is shown here below the graph. 
When there are many bars, it may be helpful to show the scale at 
both the top and bottom of the graph. Vertical guide lines com¬ 
bine with the horizontal scale to form a grid. Identification of 
the section to which each bar relates is placed at the left and 
corresponds to the stub of a table. 



Fig. 4-2 Number of Men and Women in Each Section of a First Course in Statistical 
Methods. Teachers College, Columbia University, February, 1950. 

In Figure 4-2 the bars as a whole are the same length as in 
Figure 4-1, but are subdivided to show the numbers of men and 
women. A limitation of this graph is that numbers of women are 
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not easily compared across sections, since the crosshatched por 
tions of the bars do not begin at the same scale value. 


Section 


I 


Fig. 4-3 Number of Men and 
Women in Each Section of a First 
Course in Statistical Methods. 
Teachers College, Columbia Uni¬ 
versity, February, 1950. 


II 


III 




10 20 
Number of Student* 

Women 


This limitation is removed in Figure 4 3 where the bars for 

both sexes begin at a common zero point am a ■ lx u'^pnts can- 
compared. However, in this graph, total section enrollments 

not be easily compared. .• 

If the purpose of the graph is to show only 
composition of the three sections, this purpose is better served y 
Figure 4-f in which the sexes are represented by pci cent o ). 



P#r c*nl 


Wom«n 

™ “““ ■ Eoch S .c,i„„ o. a Fi.st Cou.se in S,o.is.ic=l 

r: 1950 

A more 

pictures the data in Table 3.4. l ' - rather than actual 

numir::: have been set up to indicate the 





GRAPHIC METHODS 



percent of each subgroup in the upper or lower half of the entire 
group. This graph may be considered as a means of emphasizing 
the points made in the discussion which accompanies Table 3.4 
in Chapter 3. 



Arithmetic T#tt 



60 60 40 20 0 20 40 60 60 

Percent Perce*! 

b«low middle ikon middle 


Fig. 4-5 Percent of Students in Each Section Who Scored above or below the Middle 
of the Entire Group on Each of Two Tests. A First Course in Statistical Methods. Teachers 
College, Columbia University, February, 1950. 


A column chart used to represent similar data for different 
periods of time is shown in Figure 4-6. A group of three bars is 
shown for each year and two of these bars are divided. The original 
chart from which Figure 4-6 was copied used color most effectively, 
as many advertising media now do, but that color has not been 
reproduced here. 

Data dealing with the same variable at successive periods of 
time are called time scries. Several methods of graphing time 
series will be presented later in this chapter. 

Cautions. (1) All scale values from zero on must be repre¬ 
sented. One might have been tempted to begin the scale of Fig¬ 
ure 4-1 at 20 rather than at zero, but such curtailment would 
have given a misleading picture of the comparative enrollments 
of the sections. 

In a chapter entitled -'The Gee-Whiz Graph” in How to Lie 
with Statistics, Darrell Huff describes a graph designed to show 
a 10-percent increase in national income in a year. He com¬ 
ments : 


Now that’s dear enough. It shows what happened during the year 
and it shows it month by month. lie who runs may see and understand, 
because the whole grapli is in proportion, and there is a zero line at the 
bottom for comparison. Your 10 percent looks like 10 percent, an upward 
trend that is substantial but perhaps not overwhelming. That is very 
well if all you want to do is convey information. But suppose you wish 
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1946 


1947 


1948 


1949 


Source: 


I 7*tU • 

Fig. 4-6 Amounts Gained by ShareHolders.Em^ ^ ]<)5Q 

Financial Statement of the Standard 0,1 Company of New 

to win an argument, shock a reader, move: hin.into action sell him some- 

thing. For that this chart lacks schmaUz.Choy^ll ^ 

that’s more like it . . . nothing as *cn^ ^ ^ is a national-income 

sion that it gives. But what the las j ^ twc i vc months, all because 

line that has climbed halfway up u . h as become visually 

most of the chart isn’t there any more ... a small • 

a big one. 

Valid exceptions to the r ^ uir ® mCnt “nor ^'arefuf considera- 
at 0 are rare and should he a. o«c< on the chapters 

tion. Some appropriate exeept.ons are dcscr.beU 

on regression and correlation. values represented by the 

(2) Figures showing the numrwl v.ues «P « f the bars. 

bars should never be so placed as ° and closc to their ends, 

If such figures are placed out sic c relative length of 

they may mislead the eye by exaggeratmg 





GRAPHIC METHODS 


40 

the shorter bars, as in 



or at the left and sufficiently removed so that they do not appear 
to be part of the bar, as 

1507 


2321 



3016 


' > , .,/• / '/ V/, ' /'/ ’ * ■ '.V 

- - - - - - - ' - * - * - y. //. ... //. /. s. /t. /■ //' / 


The Pie Chart. This type of chart is used to show the partition¬ 
ing of a total into component parts. For example, Figure 4-7 
shows the proportionate enrollment, of three sections of a course in 
statistics. A very common use of the pie chart is to represent the 
division of a sum of money into its components. Thus the entire 
circle, or pie, may represent the budget of a family for a month, 

and the sections may represent portions of the budget allotted to 
rent, food, and so on. 

The pie chart is so called because the entire graph looks like 
a pie, and the components resemble slices cut from the pie. To 
make the chart, a circle with arbitrary radius is drawn to repre¬ 
sent the total. I he size of the radius depends upon available space 
and other factors of presentation. The components are then ap¬ 
portioned according to angular measure. The ordinary protractor 
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is based upon a scale in which the total circle is 360 degrees but 
it is possible to purchase a protractor in which the entire crchis 
divided not into 360 but into 100 equal parts so tha the tngfc 
representing any desired percent can be read directly E the 
protractor can be used to lay off the appropriate angles at the 

center of the circle, thereby setting up the components. 

Ordinarily the first radius is drawn vertically from the cente . 

The com — are then arranged in order in 

tion. If there is no natural order among the components, they 

are usually arranged in decreasing order of size. 


Fig. 4-7 Percent of Students in Three Sec¬ 
tions of a First Course in Statistical Methods. 
Teachers College, Columbia University, Febru¬ 
ary, 1950. 



The Line Graph. In this section ^^ daUat 

sidered as a means of presenting corr^ ^ of the line graph 
successive periods of time or * ^ relation8 hips between 

to represent frequency dist in later chapters, 

variables other than time v i * . 4 j The stub shows the 

Several time series are shown in * ^ wg a set 0 f statistical 

time period, and each of the ot ler co ^^ per iods. Such data 
data which has been changing \ ov ^ horizonta l scale shows the 
are called time senes. In Fig L Obviously, it is both 

time periods from the stub o * int 0Il this scale, 

unnecessary and impossible to s io\ expressed in units of 

The vertical scale shows number of w orke^ e P the zer0 

1,000,000. Here it is both possib.eand e^ent ^ ^ 4 _ 8 
point. Place a sheet of paper ov changec i. The grid consists 

and see how its meaning appears drawn as to facili- 

of a network of vertical and horizontal lines so 

tate construction and reading o * e g ocess 0 f plotting points 

Readers who are ncifam. att ention to the con- 

on coordinate axes should P > Table 4.1. For the 

struction of this chart, comparing it with Table 



42 


GRAPHIC METHODS 


TABLE 4.1 Old-age and Survivors Insurance 

(000 Omitted) 



Number of 

Number of 


living 

workers 

Year 

persons 

with 


with wage 

taxable 


credits 

earnings 


year-end 

in year 

1940 

44,900 

35,393 

1941 

51,000 

40,976 

1942 

58,500 

46,363 

1943 

65,400 

47,656 

1944 

69,500 

46,296 

1945 

72,400 

46,392 

1946 

75,000 

48,845 

1947 

77,100 

48,908 

1948 

79,200 

49,018 

1949 

80,600 

46,796 

1950 

82,400 

48,100 

1951 

87,700 

58,000 

1952 

90,500 

60,000 

1953 

93,000 

60,000 

1954 

95,200 

60,000 

1955 

101,700 

68,000 


Number of 
workers 
fully 
insured* 
year-end 


24.200 

25.800 
28,100 

29.900 

31.900 

33.400 

35.400 
37; 300 

38.900 
40,100 

59.800 
62,000 
66,600 

69.200 

69.800 
69,600 


Fully insured workers have at least 1 quarter of coverage (earned at any 

time after 1936) for each 2 quarters elapsing after January 1, 1951 (or after reaching 

age 2 if that is later) and before death or age 65, with at least 6 but never more 
than 40 quarters required. 

Adapted from the 1956 Life Insurance Fact Book, which acknowledges as 
original source, the Social Security Administration. 


year 1040, Table 4.1 indicates that 24,200,000 workers were fully 
insured. The chart conveys the same information by a dot placed 
directly above the point 1040 on the horizontal axis and directly 
to the right of the point 24,200,000 on the vertical axis. (Actually, 
the vertical scale cannot be read with complete precision and the 
numbers may be rounded to two places.) The reader should iden- 

.! y , ea . C 1 1 pomt 011 t,le 8 ra P h wllich represents a pair of entries in 
the table. 


" ‘ en * 1,10 S ra P h ,s to be drawn, the scales are laid out on 
axes at right angles to each other, and a grid is constructed or a 
read}-made grid is obtained by using cross-section paper. There 
is no necessary relation between the units of the horizontal scale 
and the units of the vertieal scale, and these are usually adjusted in 
such a way as to produce a figure which fits the page on which it is 
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i “:::s 

the manner described in the preceding paragraph 

"SEtfiEV rtii 

Th7i“'is; si.» 

1949 and 1950 there was . ^ spllrt , 

graph gives no indication of ^ # cliange in the social se- 

Sty law r which 0 admitted many additional workers to old age 

xtZT r£ 

son. In Figure 4-9 the number of vorka* pigurc 4 9 

compared with the number iay n K „*i 1()W until 1949 the number 

shows more vividly than the ta c oe. number 

cl Icily 1—rf «» «-£*£wort™ 

with taxable income but ain : are drawn on 

has been consistently greater. varied forms 

one chart, they can be 

The Semiloganthmic Graph la school thc 

interesting figures about enro rn ^ education for 6 

secondary school, and in institutions k 
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Fig. 4-9 Number of Workers with Taxable Income and Number Fully Insured at Year- 
end under Old Age and Survivors Insurance. 


TABLE 4.2 Enrollment in the Elementary Schools, Secondary Schools, and 
Institutions of Higher Education and the Entire Population of the United 

States at the Census Years, 1900-1950 


Year 

Elementary 

school 

Secondary 

school 

Higher 

education 

Entire 

population 

AT* 

Percent 
change f 

N* 

Percent 
change t 

N* 

Percent 
change t 

N* 

Percent 
change t 

1900 

16,262 


699 


238 


76,092 


1910 

18,529 

14% 

1,115 

60% 

355 

49% 

92,407 

21% 

1920 

20,964 

13 

2,500 

124 

59S 

68 

106,466 

15 

1930 

23,718 

13 

4,804 

92 

1,101 

84 

123,077 

16 

1940 

21,107 

-11 

7,123 

48 

1,494 

36 

132,122 

7 

1950 

22,202 

5 

6,427 

-10 

2,616 

75 

151,677 

15 


* In thousands 

t Change between two consecutive census years divided by the figure for the 
earlier of the two years. , 

The figures were taken from various editions of the Statistical Abstract of the 
United .S tales. There may be some minor inconsistencies because of changes in the 
manner of reporting. For example, the eighth-grade enrollments may sometimes 
be included in the figures for the elementary school and sometimes in the figures 
for the secondary school, depending on whether the organization is 8-4 or 6-3-3. 

successive census years. Figure 4-10 shows the elementary school 
enrollments, secondary school enrollments, and the entire popula¬ 
tion of the United States l>v a line graph of the type previously 
described. It was not feasible to show enrollments in higher educa- 
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Fig. 4-10 Elementary and Seconda y 
the Year* 1900-1950. 

tion on this chart because they distinguishable 

vertical scale that the line gr create s a general impression 

from the horizontal axis. 1 he ^ rapid i yi while school 

that the total population has be tual numbers such is 

enrollments have grown only Reference l0 Table 4.2 

the case, but certainly not in pc which are not reflected 

shows some very large percents of ioctc 

by Figure 4-10 in any satisfa y ^ equa l intervals on the 
In the scales discussed up t0 J ? the varia ble. Such scales 
scale represent equal amounts of cl ( ^ make use „f a scale in 
are termed arithmetic sea es. j percents of change m t ic 

which equal interva s rep ^ j )e(J i oga rHhmic scales. . . , _ 

variable. Such scales are j representing time, is an i- 

!„ Figure 4-H the horizontal scale,esP ted by the same scale 

metic, each 10 -year g itlim ic, so that an increase of 

interval. The vertical scale is loga 
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10 percent in any variable would be represented by the same in¬ 
terval on the vertical scale, no matter whether it was an increase 
from 200 to 220, or from 3000 to 3300, or from 60,000 to 66,000. 
To understand this scale, the numbers 200 and 220 might be 
located on the vertical scale and the interval between them marked 
with two points on the edge of a piece of paper. Then if that piece 
of paper is placed so the first point is at 3000, the second point 
will be found at 3300. Now any other number may be selected, 
say 60,000, the piece of paper placed so the first point falls at this 
number, and the second point will fall to a number which is 10 
percent larger. Clearly equal distances on this scale do not repre¬ 
sent equal numbers of persons but equal rates of change. A graph 
in which, as in Figure 4-11, one scale is arithmetic and one is 
logarithmic is called a semilogarithmic graph. In Figure 4-11 the 
vertical scale is in tiers of equal height, four of which are shown 
wholly or in part on this graph. The lowest tier is used for num¬ 
bers less than 1,000,000, the next for numbers between 1,000,000 
and 10,000,000, the third for numbers between 10,000,000 and 
100,000,000, and the fourth for numbers of 100,000,000 to 
1,000,000,000. 

To draw a semilogarithmic graph, one buys semilogarithmic 
graph paper and then plots points in the usual fashion. On such 
a chart, a straight line indicates a constant rate of change over 
time; of two lines, the one with steeper slope indicates the greater 
rate of change. 

In Figure 4-11 the lines for elementary school enrollments and 
for the general population are nearly straight from 1900 to 1930 
and are also nearly parallel. We draw the conclusion that they 
were increasing at about the same rate and that the rate of increase 
was relatively constant. Reference to Table 4.2 confirms that 
conclusion. Between 1940 and 1950 both were affected — the 
elementary school enrollments much more severely — by the drop 
in birthrate during the depression j'ears. 

T he lines for enrollments in secondary school and higher educa¬ 
tion look roughly parallel from 1900 to 1930, but if you will meas¬ 
ure the vertical increase for each between these two years, you 
will see that it was greater for the secondary school, indicating a 
greater percent of increase there. However, after 1930, increase 
proceeded more rapidly in higher education. Measure the vertical 
change for higher education over the 50-year period. You will 
see that the line lias climbed more than the "width of one complete 
tier, indicating that enrollments were more than ten times as great 
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question. Thus the ^"' carrying rakes, the number 

might be represented by a r ^ bricf cascSi etc. Each 

of office workers by a row . r of wor kers. Although 

figure would stand for a speci ^ they enjoyed a period 

pictographs have been in use fo b 



GRAPHIC METHODS 


48 

of great popularity in the 1930’s under the influence of Otto von 
Neurath of Vienna who had the idea that they could be developed 
into a universal language of science. To the person not familiar 
with graphs or with statistical ideas, the pictograph is probably 
more interesting than the undecorated bar graph or the line graph. 
However, the modern trend is to create interest through the use 
of color and of background pictures and the number of good picto- 
graphs published today is much smaller than it was twenty years 
ago. The pictograph has several disadvantages over other types 
of graph: (1) Drawing little symbols takes some inventiveness; 
they must be very simple and must depend more on outline than 
on detail; different symbols used in the same chart must be easily 
distinguished. (2) The construction of a bar graph or a line 
graph is easier except for large agencies which can have a symbol 
reproduced by a rubber stamp. (3) The pictograph is inflexible 
and suitable only for very simple data; the data of Figures 4-8, 
4-9, and 4-10 could not be effectively presented by a pictograph. 

Role of Graphs in Presentation of Data. While this chapter 
has described a variety of methods for presenting data graphically, 
it is not the intention of the authors to suggest that all, or even 
nearly all, data should be so presented. The preparation of graphs 
consumes a great deal of time and effort. Graphs also occupy a 
great deal of space when included in a report. For these reasons, 
data should be presented graphically only when such a presentation 
makes an essential contribution beyond that which might be at¬ 
tained by reference to data in a paragraph or by presentation in 
a table. 

Several of the earlier graphs shown in this chapter deal with 
data which do not justify graphic presentation, since the mere pre¬ 
sentation of the frequencies involved makes the point with suffi¬ 
cient effectiveness. These graphs were constructed as a means of 
introducing the topic by use of simple materials. Figure 4-9 pre¬ 
sents an illustration of a graph which performs an essential func¬ 
tion by making relationships vivid in a situation in which tabular 
presentation would have been less successful. 

In deciding whether or not data are to be presented graphically, 
or in making a choice between diverse ways of graphing data, 
account must be taken of the audience to which the presentation 
is directed and the circumstances under which the presentation is 
to be made. Data which are easily comprehended by one group, 
when presented in tabular form, may require presentation in 
graphic form to a less sophisticated group. Graphic presentation 
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helps even a sophisticated group to see trends and relationships 
when the data are sufficiently complex. When it is expected that 
an audience will have little time to devote to the comprehension 
of statistical data, it is very helpful to devise graphic means for 
nrnhing the data readily comprehensible. 



Data on a Scaled Variable 
Tabulating and Graphing 


This chapter and the three which follow it will be devoted to prob¬ 
lems of summarizing and analyzing data on a scaled variable. The 
most elementary of those procedures, tabulating and graphing, 
will be the subjects of this chapter. 

Several of the frequency distributions in Chapter 3 relate to 
data classified in unordered categories. Thus in Table 3.1, stu¬ 
dents are classified in three sections. True, these sections are 
labeled I, II, and III, but the numerals are merely names and the 
sections might as well have been called A, B, and C. In other 
tables in that chapter, students were classified according to sex 
as well as according to class section; these are unordered classes. 

For a scaled variable, the values have a necessary order and 
numerical relation to each other. 

Frequency Distribution on a Scaled Variable. A tabulation 
of data on the scaled variable age is shown in Table 5.1. The 
original records in the decennial census were for age at the last 
birthday but here, for convenience of presentation, the original 
data have been grouped in larger intervals. Each interval except 
the highest is an interval of 5 years. No upper limit is stated for 
the highest interval and so it is called an open interval. Every 
individual — or subject — in the study is allocated to one of these 
intervals which we call class intervals. Thus the 150,697,361 per¬ 
sons enumerated in the 1950 census have been classified in the age 
groups named. These classes are called mutually exclusive because 
it is impossible for any one subject to belong to more than one 
of them. 

A set of mutually exclusive classes and the number of individ¬ 
uals belonging in each class constitute what is called a frequency 
distribution. The number of individuals in any one class is called 
a frequency and in this text will usually be indicated by the letter 
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TABLE 5.1 Population of the United States in 4 Census Years, Classified 

by Age 


Age 

in years * 


Number of persons t enumerated in the Census 


Total, all ages 
Under 5 
5-9 
10-14 
15-19 
20-24 
25-29 
30-34 
35-39 
40-44 
45-49 
50-54 
55-59 
60-04 
65-69 
70-74 

75 and over 
Not reported 



1920 


150,697 
16,164 
13,200 
11,119 
10,617 
11,482 
12,242 
11,517 
11,246 
10,204 
9,070 
8,272 
7,235 
6,059 
5,003 
3,412 
3,855 


131,669 
10,542 
10,685 
11,746 
12,334 
11,588 
11,097 
10,242 
9,545 
8,788 
8,255 
7,257 
5,844 
4,728 
3,807 
2,570 
2,643 


122,775 
11,444 
12,008 
12,005 
11,552 
10,870 
9,834 
9,120 
9,209 
7,990 
7,042 
5,976 
4,646 
3,751 
2,771 
1,9.50 
1,913 
94 


105,711 
11,573 
11,398 
10,641 
9,431 
9,277 
9,086 
8,071 
7,775 
6,346 
5,764 
4,735 
3,549 
2,983 
2,068 
1,395 
1,470 
149 


fss - - 7 . 

Ornately 

figure will give a good approximation to the origin 

. . • T a ble 5.1. Table 5.1 pre- 

/ unless it is named in words, as l 18 . u having the same 

Bents four different frequency distr.but.ons 

set of classes. • n r the contribution to sta- 

In order to understand somet ^"f ibution C an make, it may 
tistical thought which a frequen y entries in Table 5.1. 

be worthwhile to take a closer oo down that 

Look first at the figures for 1920 Lett mg the ey^ ^ ^ 

column, it will be noticed that e e ce( ]j n g 5-year period, 
selected is smaller than the entry or .. . at a given age de- 
In other words, the number of persons living & ^ conse . 

creases as age increases. This P ie f rcnU ency for the open 
‘quence of mortality. Of course, J for the previous 

period “75 and over” is larger than the requen y covers 

period, but this is understandable since 7o 

more than a 5-year period. , . same steady de- 

None of the three later censuses shows 
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crease down the entire column. In 1930 there are fewer children 
under 5 than in the age group 5-9, obviously reflecting a lowered 
birth rate in the late 1920’s, but after that youngest group, the 
number of persons living decreases as age increases. 

The 1940 figures have been affected also by a second decrease 
in birth rate during the depression in the 1930’s, and so we have 
the remarkable phenomenon of the number of living persons actu¬ 
ally increasing over the first four age groups. The sharp rise in 
birth rate at the close of World War II is reflected in the 1950 
figures in the large frequencies for the two youngest age groups. 
The three groups between ages 10 and 25 are still showing the effect 
of the lower birth rate in the earlier decades. 

Reading across the table, comparing frequencies in the same 
row, is also instructive. Although the total population of the 
country increased every decade, the number of children under 5 
decreased from 1920 to 1940 not only in proportion to the popula¬ 
tion but in actual numbers. The entries in rows near the bottom 
of the table show striking evidence of our aging population. The 
number of persons aged 75 and over has increased much faster 
than the total population. The total population increased about 


43 percent from 1920 to 1950: 


150,697 

105,711 


1.43; while the number 


3 855 

in this oldest age group increased about 162 percent: = 2.62. 

Construction of a Frequency Distribution. The frequency dis¬ 
tributions in Table 5.1 were presented to us in completed form, and 
our role in regard to them has been more or less passive. But sup¬ 
pose we had the active task of constructing such a distribution, 
what problems would we face? It has already been said that a 
frequency distribution is the result of assigning each individual in 
the group studied to one of a set of mutually exclusive classes. 
Then, obviously, the problems are the selection of classes and the 
allocating of individuals to them. 


The matter of allocating individuals to classes has already been 
discussed in Chapter 3 and need not be treated further here. For 
the census data we have been discussing, an IBM card was punched 
for each person and the cards were sorted and tabulated electri¬ 
cally, according to the age intervals or classes shown in Table 5.1.* 
The number of cards in each age interval is the frequency in the 
interval. 


We shall give some attention to the manner in which the in¬ 
tervals of a frequency distribution are set up. To accomplish this, 
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it will be necessary to examine closely the nature of a scaled vari¬ 
able and the way in which it can be divided up to provide appropri¬ 
ate classes for a frequency distribution. . ,, _ 

Discrete and Continuous Variables. Some scaled va.rialbl » 

such as the number of children in a family the number of books 
loaned by a library in a week, the number of automobiles sold by 
a dealer fn a spebified period - can take values on'y ^ “rtam d - 
Crete points on a scale. Such variables have already been te.med 

discrete variables. . — 

Other scaled variables - such as age weight or temperat 

are termed continuous because they can have values at everj pm 

on a continuous scale. .„ oc nf o con- 

Obviously not all the infinitely many actu ^ ^ confront . 

tinuous variable can be reported. One •-kip to decide in 

i„ e . p,„ *«,o e ..h. ; <*. o. h ” 

advance what values he will repoit. , c j 1 ii ( j w ho 

might decide to report no fractions of a pound, 

-r i “’so 1 ” lb .rJ-....... 

such values as 51.5, 52.0, 52.5, 5 . , 

Va ‘Foi rh h e e ir.yt 2 prrdJr^vhich 

continuous variable will be 

Pre Th t etale of a Continuous Variable. In order 

quency distribution of a continuous 7^^.^ tQ undcrstal id the 

computations with scaled data, • variable 

representation of scores on the scale of such a enable. 

Ag# to last birthday 
Aga to ntareifc birthday 

i • 'Tohip 5 1 ace was reported to 
For the census data shown m ons ’ reported as 16 years 

the Inst birthday. Thus some o P , , ile ot liers are 

old have barely passed their “f 0 * b, J^ tinuolls variable 

on the verge of their seventeen 1 . • ig a d j sc rete variable, 

but age reported by the census scale marked off in 

and only integers are reported. It an ag 
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units of 1 year, for the census data or any data recorded as “age 
in years at last birthday,” the single number 16 represents all 
ages in the interval from 16 to 17 with midpoint at 16.5. If age 
is reported “to the nearest birthday,” the number 16 represents 
the interval from 15.5 to 16.5 with midpoint at 16.0. Many 
people would find great difficulty in deciding which is their nearest 
birthday and, therefore, many studies of age call for the last 
birthday. Because both methods of reporting are in use, there is 
considerable ambiguity in age data unless an author tells exactly 
what a unit means. A column head should never be simply “age” 
or even “age in years,” unless somewhere in title or subhead or 
footnote there is an explanation of where the year begins. 

A year is sometimes indicated by its end point, as when a small 
child says he is “going on four” or an old man says, “I will be 95 
at my next birthday.” Because we name centuries by their end 
points, the century in which we are now living is called the twen¬ 
tieth. If a questionnaire item reads, “How many years have you 
been in your present position? Include the present year,” each 
answer would represent a year interval named by its end point. 
Thus an interval named 6 would extend from 5 to 6, with midpoint 
at 5.5. The foregoing discussion is intended to emphasize the im¬ 
portance of thinking carefully about the meaning of scale units 
and giving the reader adequate information for interpreting them 
correctly. 

For most scaled variables other than age, it has become cus¬ 
tomary to name a unit by the value of its midpoint. The width 
of the unit in which measurement is made indicates the degree of 
precision of the measurement — the narrower the unit, the more 
precise the measurement. Thus in measuring lines to the nearest 
inch, the line OA will be reported as 3 inches long; in measuring 

to the nearest half inch, as 3; in measuring to the nearest quarter, 
as 2J. 
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l 

J 

To nearest inch 

0 

1 

2 

. i 

3 

. i 

4 

. i 

To nearest half inch 

0 

1 _L_ 

1 

_!-» I 

2 

...1. 

3 

i.l. 

4 

i . t 

To nearest quarter inch 

0 

1 

2 

3 

4 


0 

Scale 

: * inch 

= 1 inch 

A 



In measuring to the nearest inch, the number 3 represents a unit 
1 inch wide extending from 2-J to 3*, with midpoint at 3. 


GROUPING SCORES INTO WIDER INTERVALS 55 

In measuring to the nearest half inch, the number 3 represents 
a Jit one half rnch wide, extending from 2f to 31 with midpoint 

^ 3 In measuring to the nearest quarter inch, the number 3 repre¬ 
sents a unit one quarter of an inch wide, extending from 2* to 31, 

with midpoint at 3. 

EXERCISE 5.1 

1. w»— - • “fsrs.-s'srt- 

reporting measurements are uipv of & ^ on the scale, exactly 

sidered as representing the midp labeled 7? How wide 

what portion of the scale is covered by the unit labeled 

is the unit? 

(a) 5, 6 , 7, 8 , 9, 10, 11, 12 

(b) 3, 5, 7, 9, 11, 13, 15, 17, 19 

(c) 5.5, 6.0, 6.5, 7.0, 7.5, 8.0 

(d) 4, 7, 10, 13, 16, 19, 22 

(e) 6 . 8 , 6.9, 7.0, 7.1, 7.2, 7.3 

(f) 5|, 6 , 6 i, 6 i, 0}, 7, 7J, 7* of precision 

2. Suppose a continuous variable w nting five successive scale 

shown below. Write five numbers reprei* mng^ 

units, and let 20 be the second of the five nu 

(a) 0.5 Answer: 19.5, 20.0, 20.5, 21.0, 21.5 

(b) 0.1 • 20 0 • 

(c) 3.0 • 20 0 * 

(d) 0 01 • 20.0 

Grouping Scores into Wider Intervals Jin into 

ages originally recorded m units of 1 ^^;^^ ^ ^ ^ ? g> 

5-year mtervals. Thus the age g labe i e( ] 5 - 9 . From the 
or 9 have been combined into an in that t h e interval 5-9 

original meaning of the year um , fi f t h birthday but 

includes ages of children who have p • 15 _ — 



not their tenth and so on a linear scale it 

width 5 extending from 5 to 10 wi ™ an j n terval 5-9 would 
For almost any other type of scaled ’ 4 i t0 9| with mid- 
represent an interval of width 5 extending from 4* 

point at 7. 
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4 5-9 10-14 


Examine the preceding sketch in which a scale unit is named 
by its midpoint, and look at the interval which includes the five 
units, 10, 11, 12, 13, and 14; it has been named 10-14 and these 
numbers are called its score limits. However, the interval actually 
extends from point 9.5 to point 14.5, and the interval limits 
9.5-14.5 are called true limits or real limits. The midpoint of 
this interval is at 12, which is halfway between the real limits, 
J(9.5 -1- 14.5) = 12; and is also halfway between the score limits, 
4(10 + 14) = 12. This midpoint is the class value, or the class index, 
or the class mark. The size of the class interval, or the width of the 
interval, is the difference between the real limits, 14.5 - 9.5 = 5.0, 
and shows the degree of precision of the measurement. It is com¬ 
monly denoted by the letter i (for interval). The class interval is 
also called the step interval or the class sort. Study the relation 
of these various terms as illustrated for several successive in¬ 
tervals: 


Scores included 

Real 

Score 

Class 

• 

t 

in the interval 

limits 

limits 

index 

20, 21, 22, 23, 24 

19.5-24.5 

20-24 

22 

5 

15, 10, 17, 18, 19 

i 1.5-19.5 

15-19 

17 

5 

10, 11, 12, 13, 14 

9.5-14.5 

10-14 

12 

5 

5, 0, 7, 8, 9 

4.5-9.5 

5-9 

7 

5 


Some authors prefer to name an interval by its real limits. 
However, that practice seems to invite mistakes in tabulation, as 
the person tabulating might place the number 14 in the interval 
marked 14.5-19.5 instead of in the one marked 9.5-14.5 where it 
belongs. This book will follow the practice of naming an interval 
by its score limits. 


A careless mistake often made by beginners is to underestimate 
the width of interval by taking the difference between score limits 
instead of real limits. For the data we have been discussing, this 
mistake would result in saying i = 14 - 10 = 4, or i = 9 - 5 = 4. 
Look again at the sketch of the scale to see what parts of that scale 
«) - 5 = 4 and 14 — 10 = 4 cover and to see exactly why i is not 4 
bul 5. The width of interval is, of course, also the difference 
between successive class indices which are 7, 12, 17, and 22. 
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EXERCISE 5.2 

1. Draw a scale and mark on it the score points 6.0, 6.5, 7.0, 7.5, 8.0, 

(a) ’ Draw a brace to represent the interval which includes scores 6.5, 

7.0, and 7.5. 

(b) Fill in the blanks below: 

Scores in Real Score Class { 

interval limits limits index 

9.5, 10.0, 10.5 
8.0, 8.5, 9.0 

2 . H Jch sol is an integer, and real limits are as indicated, fill in the 
blanks: 

Class 
index 


(a) Score 

Scores in 

Real 

limits 

interval 

limits 

17-19 


—— 

14-16 

— 

— 

(b) Score 

Real 

Class 

limits 

limits 

index 

30-39 

— 

— 

20-29 

— 

— 

10-19 

— 

- - 

(c) Score 

Real 

Class 

limits 

limits 

index 

70-76 

- -- 

— 

63-69 

— 

-- 

56-62 

— 

— 


56-6/ ‘ , , in ■■>, onf i qo-34 

3. If the score limits of two what is the 

what arc the score limits of the nexi b 

value of t? . 

. T . The choice of interval depends partly 

enumerators in units of 1 >ear, th or 3 months, 

possibly be made with mterva s5 °’ kcr t0 consider, before 
nr 18 months. It behooves any research _ 


J/UOOIUIJ UL HIC4V-*V^ - . 

or 18 months. It behooves anyResearch ^ ^ ^ thc 

he gathers data, the use to w hic . ef tabulations, 

degree of precision which will be dcsira > ‘ eoin| ,utation, 

If considerable precision is required in ana * °to 1 
nothing is gained by reclassifying subjects into mtei 
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than the units in which observations were first recorded. If great 
precision is not required, the use of larger intervals may simplify 
computation and clarify analysis. The presentation of a fre¬ 
quency distribution in a report is more effective if it can be so 
planned that the number of intervals is not top great and not too 
small. When the number of cases is small and the number of 
intervals is large, the distribution is likely to be irregular or ragged. 
By grouping cases in larger intervals or by obtaining a larger num¬ 
ber of cases, the form of the distribution may be presented more 
vividly. However, the intervals can be made so wide that much 
of the available information is lost. If the number of intervals 
were reduced to the absurd extreme of having only one interval, 
we would lose all information about the form of the frequency dis¬ 
tribution. In general, for purposes of effective presentation of 
data, it is a good plan to use between 10 and 20 intervals. 

Combination of an odd number of units into an interval has 
the slight advantage of making the midpoint of the interval fall 
on one of the scores, whereas combining an even number makes 
the midpoint fall halfway between two scores. However, an in¬ 
terval of 10 units is generally preferred to one of 9 or 11 partly 
because tabulation by tens seems easy and natural. Intervals of 
3, 5, or 10 units are most often used in reports. 

The Cumulative Distribution. An important aid to interpret¬ 
ing a frequency distribution is the cumulative distribution which 
will now be illustrated from the 1950 and the 1920 data of Table 
5.1. In Table 5.2 the two columns labeled frequency are identical 
with the corresponding columns in Table 5.1 except for the omis¬ 
sion of the row for “all ages” and the omission of the frequency 
for which age was not reported in 1920. Let us concentrate atten¬ 
tion on the 1950 data. An entry in the frequency column shows 
the number of persons in the United States of the ages indicated 
in the given interval; an entry in the cumulative frequency shows 
the number of persons of those ages or younger. The entries in 
the cumulative-frequency column are obtained by the successive 
addition of the entries in the frequency column, beginning with 
the frequency in the lowest interval on the scale of age. Thus the 
cumulative frequency in the interval 5-9 is 10,164 -f 13,200 = 
29,304 and means that the number of persons who are 9 or younger 
— that is, persons who have not passed their tenth birthday — is 
approximately 29,304,000, to the nearest 1000. The next cumula¬ 
tive frequency is 29,304 -f 11,119 = 40,4S3, and so on. 

Lntries in the cumulative-percent column are obtained by ex- 
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table 5.2 Cumulative Distribution of the Portion, of the United 

States in 1920 and 1950, Classified by Ag 


1950 Census 


1920 Census 


Age* 
in years 


Fre¬ 
quency t 


Under 5 
5-9 
10-14 
15-19 
20-24 
25-29 
30-34 
35-39 
40-44 
45-49 
50-54 
55-59 
60-64 
65-69 
70-74 

75 and over 


16,164 

13,200 

11,119 

10,617 

11,482 

12,242 

11,517 

11,246 

10,204 

9,070 

8,272 

7,235 

6,059 

5,003 

3,412 

3,855 


Cumula¬ 
tive 
fre¬ 
quency 

16,164 

29,364 

40,483 

51,100 

62,582 

74,824 

86,341 

97,587 

107,791 

116,861 

125,133 

132,368 

138,427 

143,430 

146,842 

150,697 


Cumula¬ 

tive 

percent 

10.7% 

19.5 

26.9 

33.9 

41.5 

49.7 

57.3 

64.8 

71.5 

77.5 
83.0 

87.8 

91.9 
95.2 

97.4 

100.0 


Fre¬ 
quency t 


Cumula¬ 
tive t 
fre¬ 
quency 


11,573 

11,398 

10,641 

9,431 

9,277 

9,086 

8,071 

7,775 

6,346 

5,764 

4,735 

3,549 

2,983 

2,068 

1,395 

1,470 


11,573 

22,971 

33,612 

43,043 

52,320 

61,406 

69,477 

77,252 

83,598 

89,362 

94,097 

97,646 

100,629 

102,697 

104,092 

105,562 


Cumula¬ 

tive 

percent 

11 . 0 % 
21.8 

31.8 

40.8 

49.6 

58.2 

65.8 

73.2 

79.2 

84.7 
89.1 

92.5 

95.3 

97.3 

98.6 
100.0 


source: United States Census Report for 1950, VoL H. ™ 1 

* To the last birthday 
t Rounded to the nearest 1WU reported 

t including those for whom age was ^ rf 

pressing the corresponding ™mulaUv ( ^ ^ ^ ccnsus 

the total. Thus for the interval 20 24, 

62,582 _ indicating that in 1950, 41.5 perce 

150,697 ' ’ T he corresponding figure for 

tion were 24 years old or young 

. 52,320 _ 496 . 

the 1920 census .s 105 562 ' ison 0 f the 1920 and 

The cumulative columns ; re different, this com- 

1950 populations but, because „ made after the cumu- 

parison can be most easily and mean ^ ut your eye 

lative frequencies have been re t columns, comparing 

travel between the two cumu a percent is always con 

the two figures for the same in er • t i, a t regardless of what 
siderably larger for the year 1920, • r than that age was 

age we choose, the percent of perso 1 
greater in 1920 than in 1950. 
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The figures in Table 5.2 have been cumulated upward, be¬ 
ginning with the lowest interval on the scale — that is, the interval 
for the lowest age. It may be equally interesting to cumulate 
downward, beginning with the interval in which age is highest. 
According to census custom, the “lowest” interval in this sense 
is at the top of the page. Thus in the two upper intervals for 1950, 
the sum of the frequencies (in thousands) is 3,855 + 3,412 = 
7,267; in the three upper intervals, it is 7,267 + 5,003 = 12,270, 
etc. From such a cumulation we could learn that in 1920 the per¬ 
cent of the population 60 years or older was 7.5 and in 1950 that 
percent had risen to 12. 

Graphs of the Frequency Distribution. While there are many 
different ways of picturing a frequency distribution, certain de¬ 
vices contribute especially to the general understanding of the 
statistical measures which will be discussed in later chapters. 
These devices are the histogram, the frequency polygon, the cumu¬ 
lative-frequency graph, and the cumulative-percent graph. 

It will be convenient in discussing frequency 
graphs to use the conventional mathematical terms 
p ordinate and abscissa. When a point P is located 
in reference to two axes at right angles to each 
1 other, the vertical distance from the horizontal 
1 axis to the point is called the ordinate of that point, 
and the horizontal distance from the vertical axis to 
— the point is called its abscissa. Together the ordinate 
and abscissa are called the coordinates of the point. 
The term abscissa originated in medieval surveying manuals 
written in Latin, in which that portion of a horizontal distance cut 
off by the shadow of a vertical staff was called the pars abscissa or 
the “part cut off.” 

1. The Histogram. Table 5.3 has been prepared from the data 
presented in Appendix Table IX. To prepare a histogram from 
the frequencies in Table 5.3, 

lay off the scale of scores on dm.™ point. 7y* ' io/ 2 ' 13/, ' i6/ a 
a horizontal axis, the scale 

increasing from left to right. Midpoint. ’ H 9 1 12 1 is"” 

This scale must be marked in 

such a way that the division points between intervals can be 
identified. With infinitely many points on the scale to select 
from, a few must he chosen to mark for identification. The most 
common methods of marking the scale are to show the values of 
the division points between intervals or the values of the mid¬ 
points of intervals as in sketch above. 


abscissa 

— n 
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TABLE 5.3 Scores on Modern School Achievement Test of Reading 

Speed Made by 109 Fourth-Grade Children 



Score / Cumulative/ 


Cumulative 

percent 



It is most important to label the scale of scores properly. 

At right angles to the scale of scores, erect a scale of ordinates. 

Sr.2:3£ - - 

vertical height “ ”* eh Ireqae.e, .hit I. 

sett r “£ £ 

column outlines need be drawn. Uttcn even 

tween columns are omitted. . . j appears to be satis- 

m Figure 5-1 the fj^ncy m ,an m ^ ^ J ^ ^ ^ ^ 

factonly represented either by situations (the fre- 

However, hi later ™ ",”1 di.tribatia. to Chapter 

ordinate but only by the area >e we ronresented by area, 

to form the habit of thinking of requenc compare two 

2. The Frequency Polygon . If we the 

distributions by presenting two is ograms difficult 

result would probably be a confusing mass of vertical 
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Fig. 5-1 Histogram: Distribution of Score, of 109 Fourth-grade Children of .City X on 
Modem School Achievement Test of Reading Speed. (Based on frequency dr.tribuhon 

of Table 5.3.) 


to identify. The picture is much easier to interpret if frequency 

polygons are used instead of histograms. 

For a frequency polygon, the horizontal and vertical scales are 

laid off exactly as for a histogram. For each interval, a point is 
located directly above the middle of the interval so that its vertical 
distance represents the frequency in that interval measured on the 
frequency scale. In other words, the abscissa of the point is the 
score at the middle of an interval and its ordinate is the frequency 
in that interval. Note that if an interval has zero frequency, the 
point representing that frequency will lie on the horizontal axis, 
and also that there is always such a point at each end of the dis- 



Fig. 5-2 Frequency Polygon Superimposed on Histogram. (From data of Table 5.3. 
Compare with Fig. 5-1.) 
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tributioR. The points thus located are connected by straight lines 

t0 Th“ relatiorfbetw^en the histogram and the frequency polygon 

can be se nt Figure 5-2 where they have been graphed on he 
same axes Examination of this figure shows that the areas of the 

two figures are exactly ^"aily matched 

i k..i. -r "* 0 ^ U 

areas but not uniformly equal 0r " S th ®® by ordina te in these 
more satisfactorily represented by “ aa 1 y , is re - 

... «gu re ,. s.,™. .1.. 

U called the .ienaiftf cumulative chart, the 

3 . The Cumulate as before and a vertical 

horizontal scale is laid off freauency (or cumulative per- 

scale is drawn to sho * much a g for the frequency polygon 

cent). Points are plott J frequency or percent instead of 
except that (a) it is cum ^ vertical sca i e> an( j (b) the 

interval frequency thei is pi- ^ middle q( aR interval but above its 
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The graph of a cumulative curve is usually a reflex curve with the 
center of curvature lying to the right in one portion of the curve 
and to the left in another portion. In architecture, an ogive arch 
(also called ogee) is one which has a reflex curve. Because cumula 
tive-frequency or percent graphs are usually reflex, Francis Galton 
named them ogives and the name is still used for this type of pres- 
entation even when the graph is not reflex. 


EXERCISE 5.3 

1. Make a graph for the cumulative-percent columns of Table 5.2, placing 
the data for 1920 and 1950 on the same axes. You will notice that these 
cumulative graphs do not have the usual ogive form. 

2. Construct a table showing the frequency distributions of the artificial 
language scores for the three sections from the data of Table VIII in 
the Appendix. First, select an appropriate interval. As the highest 
score is GO and the lowest is 14, an interval of 3 units would give 16 
or 17 intervals, while one of 5 would give 10 or 11 depending upon where 

the interval starts. 

3. From the above table, construct three frequency polygons on the same 
base line. It might be noted in passing, how confusing it would be to 
try to present three histograms on the same base line. 

4. From the table called for in Question 2, construct and graph the three 
cumulative-frequency distributions. 


Percentiles 


[ere and in the following 
^WSn/the relative 

S ““ or^he 3^ - -res in the distribution of 

cores. ... ... , tHp Kqgi(* icIcas. A count 

Some very simple data will dh»the basic ^ ^ 

ias been made of the num er o lt as s | 10 w,i in Table 

■ivity during a 20-.ni,mte penod v th resul s a ^ ^ 

J.1 for three monkeys each observed over P 

, i_ , T ;m« Eoch of Three Monkeys Changed His 
rABLE 6.1 The Number of Time Each of Five Periods 

Activity During a 20-minute Period, Recorded_ 


onkey 



A 

B 

C 


95 

77 

125 


72 

56 

13 


80 

0 

29 


66 

61 

107 


103 

101 

98 


Median 


80 

61 

98 


Range 


103 - 66 
101 - 0 
125 - 13 


37 

101 

112 


• number to characterize each monkey, 

convenient to have a sing • g t | le numbers for each 

id this might be obtained b> ^ ^ T , ms for 

onkey in ascending order-a 8 m The middle number, 

, the rearrangement is bb of activity for monkey A 

>, describes the characteristic patt ^ ^ ^ that fol . these 

id is called his median n “ m ^, , Jg and B the smallest, 01. 

ve periods C had the arges consistency of the monkeys 

/ e might also be interested n^t ^ ^ ^ or the difTerei.ee 

•om period to period, a. ^ 
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between their largest and smallest numbers. This range was only 
37 for monkey A, indicating that he did not fluctuate much from 
one period to another and so the median gives a fairly good idea 
of his score in any selected period. For B and C, however, the 
numbers differ widely from one period to another, the ranges be¬ 
ing, respectively, 101 and 112. If a sixth observation was to be 
made on each monkey, we could make a much better guess as to 
how A would come out than as to B or C. Thus each monkey 
has been characterized in terms of a typical score, the median, and 
the spread or variability of his scores, as indicated by the range 
of his scores. 

If the number of records had been even, say, six instead of five, 
how would we define the median? Let us suppose that A’s record 
in a sixth period was 69, so that his rearranged scores are 66-69-72- 
80-95-103. There is now no midscore, and any value between 72 
and 80 will have half the values smaller and half larger than itself. 
Actually, any number between 72 and 80 will satisfy the definition 
of a median, as given below. However, in such a case it is custo¬ 
mary to take an average and call £(72 4- 80) = 76 the median. 

The Median. The three sections of a class in elementary sta¬ 
tistical method, raw data for which are presented in Table VIII 
of the Appendix, have already received some attention. Table 3.4 
on page 25 shows in a general way that Section I had the most and 
Section III the fewest members making high scores in the midterm 
test and also in the arithmetic test. Now we shall make more 
explicit comparison of the three sections in terms of their midterm 
test scores. 

From Table 3.4, we note that exactly half of the combined 
group of 96 students had scores of 53 or more, and half had scores 
of less than 53. The lower limit of the interval in which 53 stands 
— that is, 52.5 — is, therefore, a point on the scale of scores having 
unique interest. It serves to describe this distribution by the fact 
that half the cases fall below and half fall above it. The score 
corresponding to this point is called the median , or the fiftieth per¬ 
centile. 

The median of a distribution is a value such that half of the scores 
in the distribution are larger and half are smaller than that value. 

The medians of the three sections cannot be obtained from 
Table 3.4. To obtain them we must have the frequency distribu¬ 
tion of each section, as shown in Table 6.2 without grouping, or 
in Table 6.3 with scores grouped in intervals of 5. We shall first 
compute the section medians from the ungrouped scores of Table 
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TABLE 6.2 


Distribution by Sections of Midterm Scores from Appendix 

Table VIII 

Ungrouped frequencies 


Number of cases 


Score 

71 
70 
69 
68 
67 
66 
65 
64 
63 
62 
61 
60 
59 
58 
57 
56 
55 
54 
53 
52 
61 
50 
49 


I 


1 


II 


1 


III 


Total 
2 


Number of cases 


1 


1 

2 

5 

4 

2 

5 

1 

1 

2 

3 

2 

2 


1 

3 

2 

2 

1 


1 

2 


1 

2 

4 


2 

1 

1 

1 

1 

4 


1 

3 

9 

8 

4 
9 

3 

3 

5 
8 

5 

6 


Score 


I 


II 


III 


Total 


48 

47 

46 

45 

44 

43 

42 

41 

40 

39 

38 

37 

36 

35 

34 

33 

32 

31 

30 

29 

28 

27 

26 

25 

Total 


4 

1 

1 

1 

1 


1 

1 

2 

2 


1 


1 


1 

3 


9 

2 

4 

3 

1 

2 

1 

3 


1 


41 


27 


28 


96 


6.2 and then make more general computations from the grouped 

,C °^JS eLine the distribution for Section f Ik Since 
N - 28 and \N = 14, the median » a score ^th.^14 ^ ^ 

are smaller and ^. larger ; ^'herc are 12 cases below the point 
distribution, we discover that th therefore, is between 

48.5 and 16 eases below 49.5. The me ^ ^ ^ x _ 4g as 

48.5 and 49.5. We may «“ but distributed evenly 

having scores not precise y ' g Qut yery ncat | y that 2 

throughout the score mterv . It abovc the midpoint, 

of h th h e 4 r 0 y Thrifo^poSo: l ^ 

r 2 “u^ ^w 8 it and 2 + 12 = 14 are above it. The 
median of Section III is 49. 
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For Section I, \ N = ^ = 20.5. Counting up from the lower 


end, we find that 20 cases have scores of 56 or less, and 25 have 
scores of 58 or less. The point we seek is in the interval from 57.5 
to 58.5, where there are 5 cases. If the 5 cases are evenly dis¬ 
tributed over that interval, we may picture them as in the sketch. 

There are 20 cases below the 
beginning point of this interval; 
we are looking for a point below 
which 20.5 cases fall.. Thus we 
want to divide the interval by a point such that 20.5 - 20.0 = 0.5 
case may be represented as below (that is to the left of) that 
point and the other 5 — 0.5 = 4.5 cases above — that is to the 
right. The point marked with an arrow in the adjacent sketch 
meets these requirements and its score value is readily seen to be 



57.6. 

In similar fashion, the median of Section II is computed as 
52.375, or 52.4. For a small group it is usually not important to 
carry the computation of the median to more than the first decimal 
place. 

Go back now to Table 6.2 and place an asterisk in each column 
to show the location of the median for the section to which the 
column belongs. Clearly the larger the median of a group, the 
higher is the general placement along the scale of scores. The 
medians show a difference between Sections I and II which was 
not brought out by their ranges, inasmuch as in both sections the 
lowest score was 30 and the highest 71. 

Computation of the Median from a Grouped Frequency Dis¬ 
tribution. In many situations the original data are no longer avail- 



Fig. 6-1 Histogram Illustrating the Position of the Median of the Midterm Scores of 
Section I. (Data from Table 6.3.) 
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We shall now carry out m (leta an cXerc ise for the reader, 

and leave those for the other sect ^ ^ wi „ maUe the general 

Reference to the histog looking for a point on the scale 

process clear. As be ore - are J ^ , listogr am, fre- 

of scores below which 20.5 cases wdl fa. ^ ^ ^ 54,5 are 
quency is represented by are , sought must fall some- 

represented by 18 are. unj^ T P ahlc 6 3 as 55-59. wh.ch >s 
where in the interval marked Thfa interV al -'eludes 9 

actually the interval from M- ^ ^ b e t0 the left of thejn echan. 

poops of which 21). o 1 .-- ~ 

cases, t-i-ptch we see I i 6.5 1 

From the adjacent skett 


- 

AB an( J therefore, 

AC" 9 ’ A 


— and, IN"'.”' 1 . 7 , 5 fl = ! c-59.5 

2 5 9 4C _ 59 5 - 54.5 = 5, or the width of the 

AB -*T l AO- S,,,CCA 2|; 5 1 = 1.4. Thus the value attach- 
interval, the length of A Bis g + j. 4 = 55.9. 

ing to the point B, wh ? ch y^clf^Uh that obtained previously 
This result is not precisely -> we | m ve here assumed that 

from the ungrouped ^’^5-59.5 are distributed evenly 
the 9 scores m the m e actually the case, 

throughout the interval and 

B ncy Distribution, by Sections, of Midterm 
Gr0UP6d ^Scores from Table 6.2 


TABLE 6.3 


Number of case* 


Score 
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TABLE 6.4 Procedure for Computing the Median 



General statement 

Find 50% of number of cases 

= __ 

Find the interval which con¬ 
tains the median 


Find the number of cases be¬ 
low the true lower limit of the 
interval located in Step 2 


Subtract the number obtained 
in Step 3 from the number ob¬ 
tained in Step 1 


Find the frequency in the in¬ 
terval which contains the 
median 


Find the ratio of the number in 
Step 4 to the number in Step 5 


Find the width of the step in¬ 
terval 


Compute the median as fol¬ 
lows: 

Lower limit of interval which 
contains the median plus the 
product of the ratio in Step 6 
by the width of interval in 
Step 7 


Application to Table 6.3 


i(41) = 20.5 


Counting to 20.5 cases either 
from the top or the bottom 
brings us into the interval with 
limits 54.5 to 59.5 


Number of cases below 54.5 is 18 


20.5 - 18 = 



2.5 


9 



2 5 

median = 54.5 + 5 = 55.9 


The procedure for computing a median may be schematically 
expressed as follows: 


lower limit 
., of interval 

1 ( ian which contains 

the median 


[ frequency below lower 
limit of interval which 

_ contains the median _ 

intervalJ frequency in interval 

which contains the median 


This procedure is displayed step by step in Table 6.4. It is an 
example of the general process of interpolation discussed in Chap¬ 
ter 14 of Walker, Mathematics Essential for Elementary Statistics. 

Try your skill on the other distributions, computing their 
medians from the grouped frequency distribution of Table 6.3. 
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The results should be as follows: 


Median from ungrouped frequencies 
Median from grouped frequencies 


57.6 

55.9 
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Total 

II 

III 

Group 

52.4 

49.0 

52.5 

52.9 

48.2 

53.1 

grouping 

has increased. 

• 


sometimes decrease^ - 

are called “(grouping erT0 ™ sec tions have now been eom- 

Other Percentiles. The three ag measure d by the 

tzs ’.srs t ,s —« — p " in “ 

other than the central values. ^ g distributio n are smaller 

Because 50 percent of fa called th e fiftieth percentile. 

than the median, the latter sco ute other percentiles. 

By selecting other percen is at the beginning. For 

The only difference m the p ^ percenti le, we start by 

ZTy'Z M™* ta —■»“■« * “““ “ °' 

lowed exactly. sym bol Xm to denote the 50th 

In this text we shaH use tn y ^ denote the gist per- 

percentile which is also the tw J ’ j symbo l t o represent 

centile, etc. There is no generally » and simple . 

a percentile, but these s> distr i bu tion is defined as a score such 
The 81st percentile of a d t have sma Uer scores. 

that 81 percent of the -cases t lead to a clear computa- 

I„ some situations ‘'''“^"'‘‘“'^.clified as follows: The 81st per¬ 
ilous .1 routine and so it M on the SC alc of scores such 

centile is the score con^P on * ng hi Jgram lies below an ordinate at 
that 81 percent of the ««“ J be adapted to any other per- 

thal point. These de n opr -, a te number for 81. 

centile by substituting the ap P ^ ation of several selected per- 
We shall now show the P nliles are to be computed 

centiles for Section I. When » t0 set up a cumulative 

for the same distribute , . Q r> To understand the fo 

frequency did,«»«»»■ ■* « ^".“ine T.Ue OX » Mi, 

lowing computations, > com putation. 

each number appearing 
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TABLE 6.5 Cumulative Frequencies for the Midterm Scores of Section 1, 

Taken from Table 6.3 


Score 

/ 

Cumulative 

/ 

Cumulative 

percent 

70-74 

1 

41 

100.0% 

65-69 

2 

40 

97.6 

60-64 

11 

38 

92.7 

55-59 

9 

27 

65.9 

50-54 

7 

18 

43.9 

45-49 

7 

11 

26.8 

40-44 

3 

4 

9.8 

35-39 


1 

2.4 

30-34 

1 

1 

2.4 


41 




What is the 25th percentile? 

25% of N = (0.25) (41) = 10.25 

A '. 26 - 44.5 + ( 10-2 y ~ 4 )(5) = 49.0 

JV .25 is also called the first quartile or the lower quartile and may 
be denoted Qi or Q L . 

What is the 75th percentile? 

75% of JV = (0.75) (41) = 30.75 

X .,6 = 59.5 + ( 3 °-^ — 2 - 7 )(5) = 61.2 

X. 7 S is also called the third quartile or the upper quartile and 
may be denoted @3 or Q u . 

What is the 12 th percentile? 

12% of 41 = 4.92 

A.i 2 = 44.5 + ( 4 ~ 92 ~ 4 0 )(5) = 45.2 


In cacii of the three illustrative computations note the fraction 
enclosed in parentheses. Its denominator is the frequency in the 
interval in which lies the particular score wc are seeking. (Verify 
this from Table 6.5.) Its numerator is the additional frequency 
we need in order that the frequency below the score point sought 
will be the correct proportion of X. Thus the fraction is the ratio 
of one frequency to another frequency and is a pure number. That 
ratio i< always a multiplier of the width of the step interval, which 
is 5 for the data wc* are using. The product of this ratio and the 
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,i,p interval gives us «« «.»« *«' h “ ° 

2™“Jong the seal. .1 >»». Thu, a. h.™: 

■J™ “f + (“>”) (" .1) ‘ 

sdditiuu .. the term, «. 5^'^ 

which you are now familiar, you are and j s0 are also 

The nine percentile value*J* X. w ^ deci , e s. All these 

called the first, second, third, .• • • “ ^ >u of them may be 

values belong to the same gen been ma de to drop the 

called quantiles. Many atte ™P b } groun d that centilc would 
first syllable of the term percentile on the g change would be 

be more consistent with decile an f J^c he^on ° Most 

a very sensible one, but som matter express approval 

people *h» !»•« P“* ‘T' 1't,g'r“” Nut u»U 

of the shorter term but go on using dec ,l e invented by 

the terms median quartile, per ^ ^ related terms 

Francis Galton before 1 , which have long since passed 

such as octile, dodecile, an per » generic term quantile 

out of the statistician's vocabulary. 1 B 

is of recent origin. Sma n Number of Scattered 

Computation of Pe « e “* ® s f cattered scores sometimes produce 
Scores. Gaps in a sma s * ose a teacher has obtained 

a puzzling situation. For exa l rrange d in order are as follows: 

scores for 20 students which when arranged 

She wishes to know X.* so she c0 ™ p “ t “ ^ PC ' scores . Obviously, 
looks for a score such that 15 case ^ ^ gomc 

this condition is met by any value , one w hich seems most 
arbitrary decision must be mad . ■£ of the wa y from 24.5 
reasonable is to select * value P manncr , X., s would be co.n- 

to 25.5, namely, 25.25. In J * bctwc en 9.5 and U.5, or as 

puted as lying 15 percent of the way 

9-8- , ■ „nnrtant to know something about 

Variability. It is often as n P mcaimre „f central position 

the variability of a group as to kr no) . any si „gl e percentile 

such as the median. Neither , scorC!i in a distribution scatter 

can give any clue as to whetl tl \ ^ Measures of van- 

widely or cluster closely about of scatter, of spread, 

ability are also called measures of dispersion, 

or of variation. 
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The following examples show the importance of studying vari¬ 
ability: The Chamber of Commerce of a community, which shall 
be nameless, advertised that the median annual temperature there 
was 70° (a true statement) and claimed that this implied a Bermuda- 
like climate. They failed to mention that sometimes the tempera¬ 
ture rose to 110° in summer and fell to - 20° in winter and that 
the number of days was small on which it actually hovered in the 
neighborhood of 70°. The median temperature would be consider¬ 
ably less important than the range for a person planning to move 
into the community. 

Two classes in college physics contain the same number of 
students and have the same median score on a prognostic test, 
but the range of scores in one is three times as great as the range in 
the other. The problems of teaching these classes will be very 
different. 

The people who live in village A and those who live in village 
B have almost the same median annual income. However, A is 
a very homogeneous community in which the difference between 
the highest income and the lowest income is small. In B, on the 
other hand, although there are a few middle income families, the 
population consists largely of two extreme groups, one composed 
of high salaried business and professional people and the other 
composed of domestic servants and unskilled workers. To describe 
these communities, it is clear that information about median in¬ 
come needs to be accompanied by some measure of income dis¬ 
persion. 

The Range. As a measure of the scattering, or dispersion, of 
the scores in a distribution, the range comes first to mind. The 
range is simply the difference between the highest score and the 
lowest. From Table 6.2, we see that the range for each section 
and for the total of all sections is 

For Section I, 71 - 30 * 41 
For Section II, 71 — 30 = 41 
For Section III, 62 - 25 = 37 
For all sections, 71 - 25 = 46 

The range is meaningful but notoriously unstable, subject to 
the accidental presence or absence of extreme cases in the group 
studied. Note for yourself how the range for Sections I or II 
would have been affected if the person scoring 71, or the person 
scoring 30, or both, had dropped out of the course or changed to 
another section. 
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quantiles and cumulative-percent curve 

T Ranee To obtain a measure of dispersio 

InterpercentUe Range. range> it is comm0 n practice 

which is more reliable than t symmetrically placed per- 

centiles, such as X. 9 o A io, _ v which gives the 

the middle 80 percent of the cases, ^ ^’ used of \ U such 
range of the middle 90 percent. Most widely 

TABLE 6.6 Selected Quantiles Computed from the Data of Table 6.4 


Value of the quantile 


Quantile 


I 


II 


III 


All sections 


X.„ 

X.90 

X.80 

X.7I 

X.70 

X« 

x. M 

X .40 

x.» 

X.24 

x.» 

X.lO 

X.04 

Range of 
middle 50% 
middle 60% 
middle 80% 
middle 90% 
entire group * 


66.9 
64.0 
62.1 
61.2 

60.3 
58.2 

55.9 

53.4 

50.4 
49.0 

47.5 

44.6 
41.2 


12.2 

14.6 
19.4 

25.7 
45.0 


64.2 

63.1 
60.8 

59.7 

58.1 

54.8 

52.9 

51.2 
49.6 
46.4 

43.9 

41.2 
39.8 


13.3 

16.9 

21.9 

24.4 
45.0 


62.2 

59.8 

57.4 

56.2 
55.0 

50.8 

48.2 

46.5 

44.8 
42.0 
39.0 

35.5 
28.0 


14.2 
18.4 

24.3 
34.2 
40.0 


64.2 

63.1 

60.7 
59.5 

58.2 

55.7 
53.1 

50.4 

47.7 

46.3 
44.9 

40.3 

35.5 


13.2 

15.8 

22.8 
28.7 
50.0 


. “ ° f ^ , third 

interpercentile ranges is the^“^ForSelaion I, 0. - Q' “ 
quar lies, called the ^erquarhle range. ot hcr sections 

61.2 - 49.0 = 12.2. I^r'and it istnteresting to see how similar 

will be found in Table 6.6, a ^ ^ mcasure 0 f variability, 
the three sections are m r g Curou )ative-percent Curve. In 

Relation of Quantiles with t)ie cumulative-frequency 

Chapter 5, we became acq’“ curve . Now we shall put the 

curve and the cumulat.ve-pe^ ^^^ ^ quantiles graphically 
latter to new uses, first lor u 

and later for comparing * and i ast columns of Table G..> ^ 

The relation between tf-^ gu<;h a grap h, in which the 

displayed graphically m 1 >g 



76 


PERCENTILES 



Fig. 6-2 Cumulative-percent Curve for the Midterm Scores of Section I. (Data from 
Table 6.5.) 


scale of scores is shown on one axis and the cumulative percent of 
the frequency below a given score is shown on the other axis, is 
called an ogive. The scale of scores may be placed on either axis. 
However, when two or more groups are to be shown together (as 
in Figure 0-3) and the scale of scores is placed on the horizontal 
axis, the group with higher scores is represented by the lower 
line; that effect tends to confuse a person not thoroughly familiar 
with graphs. Therefore, when showing such a graph to persons to 



Fig. 6—3 Cumulative-percent Curves for Midterm Scores for Three Sections. (Data 
from Table 6.3.) 








USE OF OGIVES FOR COMPARING GROUPS 
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whom this type of presentation is new - as perhaps to a school 
board or a parents’ organization - less explanation will be 
quired if the scores are placed on the vertical rather than t 

b^As'thfs graph was constructed, the scale of scores is on the ver¬ 
ticals; plotted points lie on horizontal lines through those score 
• + mark the boundaries between consecutive intenals. 

C« 7 «fpiT» £ 1 . pm ti,« -.•« i« 

, • t.Wp fi (i The subscript for one of the percentiles will 

indicate a point on the horizontal axis »d -o^hat per¬ 

centile wilUndicate a point on « ^ ^ represented b 

a similar fashion, plot the points X ^ X q ^ a ° In j rawi the 

plotted points should lie exac ^ th cumu lative percent cor- 
graph, points were located Zn g located 

responding to g iven ‘ nt jj es ) corresponding to given cumula- 

rssrsteSBnt-. «■>- «•—- 

of relation. determine graphically some of the other 

, N0 ”SX 0 lor ««"p" . 1« «pd a *'« »*— 

entries in Table 0.0 i o line from it to the curve, 

point on the base line, he gcale of score s at the point 

then a horizontal line w 1 ■ the point B represents 

A. The scale value of AisOn Vej of the middle 

Q»- Then thc f gm ( e ,ltile range Estimate the length of AB 
50 percent, or the mterq £ that estimate with the com¬ 

as accurately as possible and compare 

puted value shown in Ta > e • • The ogive for each of 

Use of Ogives for Comparing Groups. ^ U 

the three sections is shown ... 0-2. It will 

in which the ogive for ec ion three Sections found 

he noticed that ^ r^ in U.ii ^re on a vertical 

in any one row of labk . ulat i vc .pcrcent point. It will 

=r: -.- - 

immediately obvious. 75 Uiat t i, c median 

We have already seen in Table ™ ®j^ int „„ thc s( „,e of 

of the combined group ™ & horizontal line cutting each of the 

scores and in aw 
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three section curves. On the horizontal scale, read the values of 
the corresponding percents. These appear to warrant the state¬ 
ment, “The percent of cases with scores below the median of the 
combined group is 39 for Section I, 51 for Section II, and 65 for 
Section 111.” If computed from the ungrouped data of Table 
6.2, the exact percents are Jf- = 39 percent for Section I, £7 = 51.8 
percent for Section II, and = 64.3 percent for Section III. 


EXERCISE 6.1 

1. By computing from the data of Table 6.3, verify enough of the entries 
in Table 6.6 to assure yourself that you have no difficulty in com¬ 
puting percentiles. 

2. The combined group has a greater total range than any of the sections. 

(a) Could it possibly have a smaller range than one of the sections? 

(b) In terms of the other four measures of spread, does the total group 
appear to be more variable than any one of the three sections? 

3. Could the values for the total group be found exactly by adding the 
corresponding values for the sections and dividing by 3? Are the 
values for the total group exactly equal to the median of the three 
corresponding values for the sections? 

4. Which section appears least variable if you base your opinion on the 
total range? Which if you base it on the interquartile range? On the 
value of X .80 - X. 20 ? On the value of X. w - X. w ? On the value of 

X.9S — X.05? 

5. What is the 9th decile for the total group? Is the 9th decile a score 
or a portion of the frequency? Then, would you say that a particular 
person has a score above the 9th decile or in the 9th decile? 

6. In Section I, does the median lie nearer to Q 1 or to Qj, or is it exactly 
halfway between them? In Section II? In Section III? In the 
combined sections? 

7. In Figure 6-2, locate X M and X A0 on the vertical scale and mark the 
line segment which represents the range of the middle 80 percent of 
the frequency. 

8 . Make a graph for either Section II, or Section III, or the combined 
group, similar in plan to Figure 6-2. 

9. The medians of four groups are respectively: Group A, 38.6; Group 
B, 49.2; Group C, 24.7; Group D, 41.3. Arrange the four letters in 
the order in which the groups appear to stand on the scale of scores, 
placing first the group with highest standing. 

10 . The percents of individuals with scores below 85 in five groups are 
as follows: Group A, 23 percent; Group B, 36 percent; Group C, 18 
percent; Group D, 76 percent; Group E, 53 percent. Arrange the 
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five letters in the order in which the groups appear to stand on the 
scale of scores, placing first the group with highest standing. 

11. If 28 percent of Group A have scores below the median of Group B, 
which group appears to stand higher on the scale of scores? 

12. In the sketch below, one tenth of the aria of the histogram lies to 
* the left of the line BN and one tenth to theright of the line GI. The 



D E 


w » • 

line EK divides the area in half. One fourth of the area is\ he , cft 
of CAf and one fourth to the right of FJ. The point i y,iddle 
°„ f f Z interval having the greatest fre.uenc^ Answer ea, 

f0 "7 ing "Lin J 

y'ou can name it by 

You may assume that the a, 

point of the scale is at 0. 

What represents: 

|b) The mode a (= midpoint of interval having greatest frequency) 

(c) Qi 

(d) Qi 

(e) The interquartile range 

(f) The middle half of the cases 

(g) The upper 10 percent of the cases 

(h) The 10th percentile 

13. From!the"sketch on page 80, answer the following questions: 

(a) On which axis are percentile values meas “ rc “ . ? Qf B? 

(b) What name would you give to the score of pomt A^Of 

(c) How many different nameslean, you th ent BD? 

(d) What does the segment AL rep < «■ • nt 0 G? 

(e) What does the segment I'll represent. 
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•,AS among Individuals within a Group. Suppose a 
Company pQsition ig to be filled and the committee respon- 

very imp ° r appo intment has narrowed its choice to five persons 
sdde for has different strengths and weaknesses. The com- 
caC l 4s itself unable to give scores to the candidates but after 
nuttce Oration places the names i n a n order of preference. In 
long ^ rc j s ^ ran j cs the candidates. 

lG ere are many such situations in which it is not feasible to 
urc subjects but very feasible to rank them. For this purpose 
™ not necessary to know by how much one subject exceeds 
other but only to know that he does exceed him. There are 
oher situations in which, although it is possible to obtain scaled 
measures, it is preferable to use ranks. 1 It is common practice 
to assign rank one to the subject standing highest in the char¬ 
acteristic under consideration and to assign succeeding ranks in 
descending order. That practice will be followed in this text. 
However, the reader should be warned that several recent writers 
on mathematical statistics reverse the order, using one for the 
lowest rank and that this practice was followed by Lincoln Moses, 
who wrote Chapter IS of Walker and Lev’s Statistical Inference. 

Changing Scores to Ranks. To transform scores to ranks pre¬ 
sents no difficult}' unless there are ties — that is, two or more 
individuals with the same score. For example, suppose we have 
six individuals with scores as indicated: 


Subject: A B C D E F 

Score: 12 9 8 11 9 10 


1 Most of the reasons why ranks would he used in preference to sealed scores cannot 
lu* clearly explained at tlin stage. The reader is referred to Chapters 10 and 18 of \\ alker 
and Lev, Statistical //i/m/n e, and the sources quoted there. 
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We assign rank 1 to A, because he has the highest score; rank 2 
to D, rank 3 to F. Now B and E are tied and must receive the 
same rank, but, if we give them both rank 4 and then give C rank 
5 the group of individuals will have no one ranking sixth. If it 
had been possible to distinguish between B and E, one of .hem 
would have had rank 4 and one rank 5. Therefore it is proper 
to give each of them rank 4(4 + 5) = 4.5, and to give the individual 

,0 "£E tSXui «. ranked, .he aum o, .her rank, i, .1- 


ways 

( 6 . 1 ) 


N(N + 1) 

2 


, tu- hp used as one check on the correctness of the 

ranks^which^hTvJbeen assigned. Thus, for 6 individuals, the sum 

of the ranks should be 52! = 21. Add the ranks indicated below 
to see that their sum is 21. 


Subject: 

Score: 

Rank 


A 

12 

1 


B 

9 

4* 


C 

8 

6 


D 

11 

2 


E 

9 

4 * 


F 

10 

3 


EXERCISE 6.2 

For each of these sets of scores, change the scores to ranks and check 
the sum of the ranks by formula (6.1): 

1. Subject: A B C D E F G 


Score: 

2. Subject: 
Score: 

3. Subject: 
Score: 

4. Subject: 
Score: 


25 21 18 24 20 21 21 

ABCDEFGH 


G 

A 

12 

A 

5 


9 13 
B C 


8 

B 

4 


7 

C 

7 


0 14 

D E 
10 0 
D E 
9 8 


8 12 5 

F G H 
9 7 


7 

F 

5 


G II 

3 


5 


1 

2 

I 

13 
I 

9 12 


J K L 

8 14 5 

J K L M N 

8 2 13 6 


Ambiguities in Ranks. Suppose your neighbor tells you that 

his i£h£ written home that he stands tenth in his Freshman 

class in college; should you congratulate or commiserate The 
class in college depends on the size and the nature of 

meaning of this nnk o!W P compared . The boy may be 

the group wit of BCVera i hundred students or near the 

b e oUom of a P very small class. The college in wtohheta ^nro.'cd 
may have very strict or very loose admission requirements 
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A measure of relative position must always derive part of its 
meaning from the nature of the group with which the individual is 
compared. A child of 10 may feel very tall and very mature in a 
group of younger children but, if transferred to & group in which 
he is the youngest, he may feel very small and inexperienced. He 
has not changed, but the standard of comparison has changed. 
A measure of an individual’s position relative to his group has 
obvious meaning for understanding persons and group behavior. 
Especially in the field of education and psychological testing are 
scores completely without meaning unless they can be referred to 
some standard, some bench mark. What does it mean to say 
merely that a student scored 29 on a test? Nothing at all. This 
may be a phenomenally high score or a phenomenally low one, or 
it may be the average for students of comparable age and training. 

Now consider the ambiguity in ranks due to size of the group. 
Suppose John, whose height is 52 inches, ranks third as to height in 
a group of boys. Can we express his position in terms of the 
percent of boys who are shorter than he? 

If there are 10 boys in the group, one might at first thought say 
that 7 boys, or 70 percent, are shorter while 2 boys, or 20 percent, 
are taller. This statement accounts for only 70 percent + 20 
percent = 90 percent of the group, John himself constituting the 
other 10 percent. 

If there are 50 boys in the group and John is third, one might, 
on the same basis, say that 47 boys, or 94 percent, of the group are 
shorter and 2 boys, or 4 percent, are taller. This statement ac¬ 
counts for 94 percent + 4 percent = 98 percent of the group, John 
himself constituting the other 2 percent. 

These two statements are much more meaningful than the 
mere statement that John has rank 3. The statements can be 
further improved if we think of histograms for the two distribu¬ 
tions of ranked scores. In the histogram for 10 scores, John’s 
rank is at the 75th percentile of the distribution. In the histogram 
for 50 scores, John’s score is at the 95th percentile. We can re¬ 
phrase these statements conveniently by saying that John's score 
has a pcrccnlilc rank of 75 in the first distribution and a percentile 
rank of 95 in the second. 

Percentile Score and Percentile Rank. If John is 52 inches tali 
and his height has third rank among 10 boys, the following state¬ 
ments may he made: 

(a) 75 percent of the group have height less than John. 

(b) 75 percent of the group have height less than 52 inches. 



83 

determining percentile rank 

(c) The 75th percentile of height for this group is o2 inches. 

(d) John’s height is the 75th percentile. 

(e) John has percentile rank 75 in height. 

(f) The height 52 inches has percentile rank 75. 

Study these statements carefully noting that they all express the 
same relationship but with different emphases, just as the two sen¬ 
tences “John is Mary’s brother” and “Mary is John s sister ex¬ 
press the same relationship with different emphases Note that 
statements (e) and (d) employ the term percentile while statements 
(e) and (f) employ the new term percentile rank. Compare each 
of these with statements (a) and (b), noting that 52 inches, w i 
is a score, is called the percentile, while 75 percent, which is a 
relative frequency, furnishes the percentile rank. 

EXERCISE 6.3 

Give two translations of each of these statements, by filling blanks in 
the two sentences which follow the statement. 

1. Exactly 57 percent of the teachers in a given school system receive 

salaries less than $5840. . 

_ is the-percentile of the distribution of salaries. 

-is the percentile rank of • 

2 . Of the deaths recorded from a given disease for a given year in a given 

community, 87 percent were of persons under 12 years old 

is thJ-percentile of the age of persons dying from this disease. 

is the percentile rank of age 


-- 13 tltA. - w # i ,1 *70 n 

3. Exactly one half of the scores in a given distribution are less than 7 . . 

The-percentile is-• 

The percentile rank of is • 

4. Exactly 5 percent of all persons consulted had annual incomes larger 
than $0850. 

The-percentile is-. 

The percentile rank of-is • 

Determining Percentile Rank from a Frequency Distribution 
and from a Graph. What is the percentile rank m Section I of a 
student who scored 47? Since wc have already available the osive 
of Figure 0-2, the easiest way to answer tins question is to read the 
required percentile rank there. We locate 47 in the scale of scores 
Which is the vertical scale, and draw a horizontal line through that 
point to the curve. Then on the horizontal scale wc read the corre¬ 
sponding cumulative percent. That cumulative percent ,s the 
percentile rank. In Section I the percentile rank of 47 is approxi¬ 
mately 18. Note that in Figure 6-2 percentile s (which are scores) 
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are measured along the vertical scale while percentile ranks (which 
are cumulative percents) are measured along the horizontal scale. 

The same result could, with a little more work, be obtained 
from Figure 6-1 or Table 6.3 if the ogive were not already drawn. 
Note the position of score point 47 on the scale of scores in the 
interval 45-49. We need to know what proportion of the fre¬ 
quency (that is, the area of the histogram) would lie to the left 
of an ordinate at this point. Either from the histogram or the 
table we can see that the number of cases below this point is 1 + 
3 + part of the 7 cases in the interval 45-49. To decide how many 
of the 7 cases in the interval 45-49 should be considered as below 
47, we may examine the adjacent sketch. If x represents the num- 



A B C 

44.5 45.5 46.5 47.5 48.5 49.5 


her of cases in the given interval below the point B, then 

x AB x 47 -44.5 _ * /2.5\ _ 0 c 

7 ~ AC' ° r 7 " 49.5 - 44.5 d 7 \ 5 / 3 ' 5 


The number of cases below the point 47 is, therefore, 


1 + , + (¥) 7 -7' S 

and the percent of cases below that point is 


7.5 

41 


= 18.3 percent. 


The score 47, therefore, has percentile rank of 18 in Section I. 

Another way to think of this computation is to note in Table 
6.3 that 4 cases fall below the score point 44.5 and 11 below 49.5, 
and to interpolate to find the missing number: 


Score 

point 

49.5 
47 

44.5 


Number of cases 
below score point 

11 

? 

4 


What would the percentile rank of score 47 be in Section II? 


1+5 + 




and = 25.9 percent 
£ / 


Its percentile rank would be 26. 
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What would its percentile rank be in Section III? 

2 + 4 + 2 + (^)8 = 12 


— = 42.9 percent 
28 

Its percentile rank would be 43. 


linear values is a pare number such tha., wne ^ rf ^ fre _ 

the frequency in the mterva , g interested, 

quency falling below the score point in which 


TABLE 6.7 


Procedure for Computing the Percentile Rank of a 

Score 


Given 


Stop 

General statement 

Application to score 47 for 

Section I in Table 6.3 

1 

Find the interval in which the 
uiven score is located _ 

Score 47 is in tha interval with 
true limits 44.5 to 49.5 

2 

From the given score, subtract 
the lower limit of the interval in 
which it is located 

47 - 44.5 = 2.5 

3 

Find the width of the interval 

i = 49.5 - 44.5 = 5.0 

4 

Find the ratio of the number m 
Step 2 to the number in Step 3 

2J> 

5.0 

5 

Find the number of cases below 
the true lower limit of the inter¬ 
val located in Step 1 

1+3 = 4 

1 6 

Find the frequency in the inter¬ 
val located in Step 1 

/- 7 

7 

Compute the following: 

Value of Step 5 plus the prod¬ 
uct of the ratio in Step 4 by 
the frequency in Step 6 

^+ (H) (7 > ■ 7 - 5 
—-- 

1 8 

Compute the percentile rank as 
the ratio of the value obtained 
in Step 7 to the total number of 
cases, expressed as a percent 

— = 18.3% 

41 
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Frequency sum of fscore minus beginning - ! Tfrequency"] 

below _ frequencies , point of interval in 

ordinate in all lower [ width of interval J L intervalj 

at score pomt intervals u 

The procedure is set out step by step in Table 6.7. 

EXERCISE 6.4 

The following tabulation shows the percentile rank, for each section and 
for the combined group, of four scores. Verify these by computation based 
on Table 6.3 and by reference to Figure 6-3. 


Score 

Percentile Rank 

I II III 

Total group 

41 

5 

9 

23 

11 

54 

42 

56 

67 

53 

62 

79 

85 

95 

85 

66 

94 

96 

100 

96 



Mean and Standard 
Deviation 


hapter 6 was devoted to tb~ « X^^a 

istribution can bej charac derivedjr ^ more distributions can be 

leasure of variability, { ) nosition of individuals within 

ompared; and (c) how e re , ter will deal with the 

distribution can be t the measures used 

ame three problems, the measures known as quantiles, 

„ Chapter 6 belong to the family of measures whereas 

dl of Which are based on some aspect o and 

n this chapter we shall use the me In Chapter 8, 

£ r "“pared and criteria suggested 

f Suppose five undernourished children are placed 

„„ TJSS « - *»** *i—““— 

measured in pounds, as follows: 

John, 8; Sam, 2; Max, 3; Bob, 5; George, 12 

• u* Kp rpnorted ? By the methods of the 
What average gain might be r p imm J iate i y see n to be 5 lb 

previous chapter, the me i g people are also familiar 

and this is one type of .«W- g°^ ding the SCO res of 

these 5 boys is 30 and so the mean gain is g 6. 

Symbols. To reduce the let 

shall employ a few convemc: y, ^ gum of „ is customarily 

STS* S letter sigma, Z. comparable to the 
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capital S in our alphabet. Then 2X means “the sum of all meas¬ 
ures represented by the letter X” N will be used throughout this 
text to indicate the number of individuals under consideration. 
The mean will be denoted X and this symbol can be read either 
“mean of X” or “X bar” or “bar X Note how vivid the defini¬ 
tion of a mean becomes when translated into these symbols: 


The mean can be 

obtained by 



finding 
the sum of 
all the scores 
2X 


(7.1) 


or better 



2X 

N 


and 

dividing 
it by 


the number 
of scores 

N 


This formula constitutes a definition of the mean. 


EXERCISE 7.1 

Find the mean and the median of each of these sets of scores: 

1. 3, 2, 7, 12, 9, G, 4, 5, 7, 2 

2. 19, 12, 18, 8, 23, 13, 19 

3. 415, 402, 401, 405, 409, 410, 416, 412, 414, 406 


A Measure of Variability. When the median is used as a meas¬ 
ure of central position, variability can be measured by some in¬ 
terpercentile range such as Q 3 - Qi or X. 9S - X. 05 . However, when 
central position is measured by the mean, variability should be 
measured by a statistic based on deviations from the mean. 

For each of the 5 boys mentioned at the beginning of this chap¬ 
ter, we might find the deviation of his score from the mean score, 


tions: 


G, X - 

X = X 

' - 6 and perform the following 

Boy 

X 

X - X 

| X - T | 

(A r - X) 2 

John 

8 

9 

2 

4 

Sam 

2 

-4 

4 

16 

Max 

3 

-3 

3 

9 

Bob 

5 

-1 

1 

1 

George 

12 

6 

6 

36 


30 

0 

16 

66 


I he sum of the deviations from the mean is always identically 
zero no matter how great or how small the variability, 

(7.2) 2(X-X) = 0 
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A MEASURE OF VARIABILITY 

S= £^“ d 

r t- rk&zr ssrJriS 

—*“■ t ~ “ i “‘ °' b "“"” E 

as absolute values. The customary sj , = 3 . For 

signed value of a number a is | a I* 0 f the absolute 

the 5 individuals under consideration the sum ^ ^ ^ 

values of the deviations from the m 6 kn0 wn as the 

unsigned deviations is 16/5 - * Am!> 
mean deviation from the mean. 

V | X - X 

( 7 . 3 ) Mean deviation from the mean = -jj 

i ovpmiTp the individuals in a group 

It tells us how much on the average the ^ the mean 

JrSKlTuS fo ;v::; 

Z ^e^reUmes drawn in industrial plants. 

^ VC The'othe/mrthod * 0 /removing minus'signs is to square eaeh 
deviation and take the not only 

variance. 


(7.4) 


Variance = 


Z(X^ X )_ 2 

iV - 1 


2SgiiH=== 
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but in most cases requiring less arithmetic work is 


(7.5) 


Variance = 


N2X 2 - (2X) 2 
N(N - 1) 


For our data, 2X 2 = 2 2 + 3 2 + 5 2 + 8 2 + 12 2 = 246 


Hence, the variance = 


5(246) - 30 2 
5(4) 


330 

20 


16.5 as before. 


Graphic Representation. The variance is a measure of spread 
for which no graphic representation can be made. Do not try to 
form a visual image of it. Its square root, however, can be repre¬ 
sented as a distance measured along the scale of scores. That 
square root is called the standard deviation. Later in this chapter, 
it will be used as a standard for measuring the deviations of in¬ 
dividuals from the group mean, a use which appears to be responsi¬ 
ble for its name. At present, however, we are considering its use 
as a measure of the variability of a group. The symbol 1 we shall 
use for the standard deviation is s. 


(7.6) 



V variance = 



In Figure 7-1 each of the five scores we have been discussing 
is represented by a point on the scale of scores. Each score has a 
frequency of 1, and these frequencies are represented by the shaded 
areas. The mean is at the point 6. The standard deviation is the 
line segment 4.1 scale units long, which is marked s. 

The standard deviation and the variance will be used in most 
of the statistical problems you are likely to find of interest. The 



_X gain ' n weight in lb. 


Fig. 7-1 


Gain in Weight in Pounds Made by Five Boys. 



1 When the term standard deviation was coined by Karl Pearson in 1S94, he used 
the small Greek letter sigma (cr) and defined it as 



Today, some writers use .V and some use N — 1 in the denominator. The use of N 
gives a biased estimate and the use of V — 1 an unbiased estimate of the variance in 
the population from which the sample was drawn. Today many writers prefer, so far 
as possible, to use English letters as symbols for statistics computed from a sample and 
the corresponding Greek letters for theoretical rather than observed values. Following 
this practice we shall use s to denote a standard deviation computed from observations 
but in the chapter on the normal curve we shall use cr for the standard deviation of that 
theoretical distribution. In Chapters 13 to lfi, both symbols will be used. 


COMPUTATION WHEN THERE ARE DUPLICATE SCORES 91 

feeling of strangeness which you ^ entertain in irl 

will soon wear off as you meet them. m variety.^ 

The To^t'yiut^ienTe wTth the standard deviation, but be- 

- this statistic in an actual siUiation you need to 

he able to comp ^ * “ 
Computation When T tbere had been 25 and that 

suppose that instead o Table 7 L Here, there are 3 

esz zzz:s — - 

l k nntn) 



250657) - 195* = ^ = 5.67 
Variance - - 25(24) 600 


/X s 


144 

363 

400 

81 

320 

98 

144 

75 

32 

1657 


^ 5^67 


i i t/4 Hisolav (iiitu but to illustrate 

. Note that the purpose ta “e's ^ usc(i which would he 

i—£■£ sa.’SSSi.—■ 

resultant sum may be ap pr °pnate|y Jabel^ed Z/X- + ^ = 3(121) 
manner, the 3 scores of 11 mu ^ C Ush t his is to set up a final 

- 363 to ZX ! . An easy way t° “ccomp in this co l U mn by 

column headed /X ! , and to ° ’ 1a "V, , its corresponding X. 

multiplying an entry m the /X comm 
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Thus for X = 11, we have 11(33) = 363 which is, of course, equal 
to 3(11 2 ). 

Then for a distribution having duplication of scores, it is 
appropriate to insert an f in formulas (7.1), and (7.5), making 
them read 


(7.7) 

(7.8) 



Variance = 


2/X 

N 

JV2/X 2 - (2 fxy 
N(N - 1) 


Even when the / is not written in the formula, it is always under¬ 
stood. 


EXERCISE 7.2 

1 . Make a graph of the histogram for the distribution of Table 7.1. On 
the scale of scores mark the position of X = 7.8, X — s = 5.4, X — 2s 
= 3.0, X + s = 10.2, X + 2s = 12.6. Draw a line segment of length 
s = 2.4. 

2. Compute the percentile rank of score point X + s = 10.2 

3. Compute the percentile rank of score point X — s = 5.4 

4. Compute X. M . For these data it happens that X and X. M are identical. 

5. For these data, the range is equal to how many standard deviations? 

6 . Compute the median, mean, variance, and standard deviation of each 
of the following sets of scores: 

(a) 7, 15, 10, 9, 4, 3, 7, 10, 10, 8, 16 

(b) 24, 27, 22, 28, 26 

(c) 1, 2, 5, 6, 10, 11, 12, 14, 15, 15 

(d) 1,9,0, 1,5, 1,0, 11 

(e) 2, -3, 7, -9, +8 

(f) 9, -7, 0, -4, -0, 3, -9 

7. For each of the distributions in the preceding question, what is the 
ratio of the range to the standard deviation? 

8. From the values of X and s obtained in questions 6(d), 6(e), and 6(f), 
would you say it is possible for s to be larger than X? 


Computation from an Arbitrary Origin. Suppose it is neces¬ 
sary to find the mean of the 5 numbers, 27309, 27321, 27305, 
27290, and 27298. No doubt many people would compute 2X = 

136530 and X = —^— = 27306. However, some of 3 r ou who are 


more inventive may say to yourselves, “Now, obviously, the mean 
cannot be very far from 27300. . I will just add together the 
amounts by which the various numbers differ from 27300 to see 
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Li must b. 30/5 « 8. S. ■ » 2, ”“ + 6 - 

273 nL 1 st u. genumlim this pro,,.., j. 

m“.*: -r„ 

origin and designated as A. Then, 


The mean is equal 

to 


the 

arbitrary 

origin 


increased 

by 


the mean of 
the deviations 
from that origin 

2(X -A)_ 

N 


(7.9) 


x.,+52^ 


to be 27306 we have 


X 

27309 
27321 
27305 
27299 
27296 


X _X (X-X ) 2 


3 

15 

-1 

-7 

-10 

0 


9 

225 

1 

49 

100 

384 


. 3 Bi _ qn a nd s = V96 = 9.8. If A 

By formula (7.4 , vana- =^ id % an tQ usc for . 

had not been an integer, , v v iqa vtO XX 2 = 

mula (7-4) and we might have computed ZA = 130,530, SA 

3,728,088,564. Then by formula (7.5) 

5(3.728,088,564) - (136,530^ = I** 2 ? = 9( . 

variance =-5(4) 20 

, f rn This procedure is correct but unnecessarily laborious 
To^slmrten UicTprocess we shall again take an arbitrary origin at 
the convenient point .4 = 27300. Then 

g^:^I 3 9 ? + 2P + 5 2 + (- ir+ (- 4)= - 564 
5(564) - 30 2 = 1920 __ 99 as | )e f 0 re. 

and variance = ‘ 5 ( 4 ) 20 
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The procedure is symbolized by formula 

N2(X - A) 2 - f2(X - A )] 2 
(7.10) variance -- N(N-T) - 

and this can be seen to differ from formula (7.5) only in that the 
X of (7.5) has become (X — A) of (7.10). 

The original values (here denoted X) measured from an origin 
at zero are called raw scores or gross scores. When each score is 
reduced by some arbitrary value, the results (here denoted X — A) 
are called deviations from an origin at A. When gross scores are 
large, computation by such gross score formulas as (7.1), (7.5), 
( 7 . 7 ), and ( 7 . 8 ) may be very laborious by hand computation though 
quite convenient if a machine is available. An arbitrary origin 
can be used to reduce the size of the numbers involved and thus 
to reduce the labor of computation. 

To assure yourself that the choice of A is really arbitrary, com¬ 
pute the mean and variance of the numbers 23, 29, 21, 35, 31, and 
36, using first one origin and than another, as for example: 


A 

2(X - A) 

Z(X - AY 

X 


Variance 


0 

175 

5293 

175 

6 

29.17 

6(5293) - (175) 2 

30 

1133 

30 

20 

55 

693 

20+ f = 

29.17 

6(693) - (55Y 

30 

1133 

30 

30 

-5 

193 

3° —§ = 

29.17 

6(193) - (5) 2 

30 

1133 

30 

32 

-17 

237 


29.17 

6(237) - (17) 2 

30 

1133 

30 

25.5 

22 

2G9.5 

25.5 + = 

29.17 

6(269.5) - (22) 2 

30 

1133 

30 


The outcome will be exactly the same no matter what value 
is given to A, so the computer has free choice. However, in order 
to keep down the labor of computation it is wise to let A be an 
integer (20 or 26 would be better than 25.5), which is easy to sub¬ 
tract (20 would lie better than 19 or 26), and which is small enough 
so that all deviations are positive (20 would be a better choice 
than 26). 

Computation When Scores Are Grouped in Intervals. Until we 
could take up the situation in which scores are grouped in intervals, 
it has seemed wise to restrict attention to distributions with small 
.V and narrow range. Mastering the type of problem discussed 
in this section, however, will make it possible to work with any 
distribution. 
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COMPUTATION FROM GROUPED SCORES 

Even though you may be heartily tired of the midterm scores 
of the three sections of a statistics class, it will be instructive to go 
back to them again in order to see the relation of mean and stand¬ 
ard deviation to the measures computed in Chapter 6. 


TABLE 7.2 Three Computations of the Mean and Standard Deviation for 
the Midterm Scores of Section I Taken from Table 6.3 



70-74 

65-69 

60-64 

55-59 

50-54 

45-49 

40-44 

35-39 

30-34 


1 
2 
11 
9 
7 
7 
3 


Sum 41 


Mean 


Variance 


Stand¬ 
ard de¬ 
viation 


X 

fX 

/X* 

72 

72 

5184 

67 

134 

8978 

62 

682 

42284 

57 

513 

29241 

52 

364 

18928 

47 

329 

15463 

42 

126 

5292 

37 



32 

32 

1024 


2252 

126394 


2252 _ 

54.93 


41 



X—32 /(X-32) /(X—32) 1 


40 

35 

30 

25 

20 

15 

10 

5 

0 


40 

70 

330 

225 

140 

105 

30 

0 


1600 

2450 

9900 

5625 

2800 

1575 

300 

0 


940 


24250 


41(126394) — (2252)* 


41(40) 
110650 


1640 


= 67.47 


V67.47 = 8.21 


940 c rv? 
32 + —- = 54.93 
41 


41(24250) -(940V 
41(40) 

_ = 67.47 

1640 


V67A7 =8.21 


8 

7 

6 

5 

4 

3 

2 

1 

0 


8 

14 

66 

45 

28 

21 

6 

0 


64 

98 

396 

225 

112 

63 

12 

0 


188 


970 


32 + 




93 


25 


.^ 41(970)-088) 


41(40) 


nc /4426\ „ 

= 25 UtoJ = 0 '- 


! ) 


47 


V 67.47 = 8.21 


In Table 7.2 three computations have been made from the same 
set of scores. Computation I makes use of gross scores treating 
each score in the distribution as if it were situated at the mid¬ 
point of its interval, and using formulas (7.7) and (7.8) to complete 
the computation. Do not feel obliged to carry out the "J^metic, 
but examine the table to make sure you know what has been 
done. The numbers are large enough to seem rather formida > 
in hand computation and you will doubtless prefer Methods II 

and Method II is exactly like Method I except that an arbitrary 
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origin has been taken at the midpoint of the interval 30-34, and 
so each value of X has been reduced by 32, thus making the num¬ 
bers considerably smaller. Note that the final values of mean and 

variance are identical with those of Method I. 

Method III makes use of what are called coded scores. Note 
that each value in column x' is exactly one fifth as large as the 
corresponding value in column X — 32 of Method II, and so Sx 
is one fifth as large as 2(X - 32) and must be multiplied by 5 to 
give the same result in computing X. Now 5 is the step interval, 
and we shall designate the width of the interval by i. Then, 


(7.11) = X - A 

An x ' is called a coded score. Such coded scores are obtained by 
writing 0 in the interval where the arbitrary origin is placed and 
labeling the intervals above that origin successively 1, 2, 3 . . . 
and the intervals below it -1, -2, -3 . . .. Thus the values 
of x' provide a new scale in which each step is one interval wide. 
To change this scale back to the scale of scores, we must multiply 
by i or * 2 as indicated in formulas (7.12) and (7.13). 

(7.12) X=A+z(^) 


(7.13) 


variance = 


, ( NWx'Y - 

l \ N(N - 1) ) 


Note that the only difference between formula (7.13) and the 
earlier (7.8) is that x has taken the place of X and the whole 
has been multiplied by the square of the step interval. Note 
that the arbitrary origin is always taken at the midpoint of an 
interval. 

In Method II, each value of X has been reduced by 32 but is 
still measured in score units. Therefore, .4 = 32 and % — 1. 

Other Formulas Related to the Variance. The formulas given 
here for the variance, (7.5), (7.8), (7.10), and (7.13) have been 
selected out of many algebraically equivalent formulas because 
they present an economical computing routine. The student who 
reads widely in other books will probabl}' discover some of the 
following formulas which are set down here only to reassure him 
that they are algebraically consistent with those presented in this 
chapter. 
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CHECKING 

RESULTS 


(7.14) 

2x 2 = 2(X - X) 2 = 

(7.15) 

1 —J 

CS 
• c- 

11 

e* 

H 

w 

Z/(x') 2 " - 

m 

(7.16) 

2 v, W 
N 

_0 _— 

8 = 

N - 1 


CO 

to 

II 

ro 

2 fix'Y - 

(7.17) 

N - 


ax) 

N 


ajx 


N 


Tj 


Wx') 


As formulas for the standard deviation are obtained from those 
for the variance merely by taking the square root, they need no 

bB ^ ^'previously, many texts use N where this uses N - 1 

aS CheS Result^ Retracing the steps of a computation sel- 
, • l, u. Chock methods are needed to give a con. 

JX^Sdencet the results of his work, especially if he is com- 

puting^wdthout^access to^a machm, ^ ^ 

origin recomputing Zx' and Z(x') 2 , and comparing these: ruth 
values under the first computation provides an easy check. 

Raising the arbitrary origin by r intervals 

n i«\ changes 'Zx' to Zx — Ar 

(7.Si and changes 2 (x') 2 to S(x') 2 - 2r2x' + Nr 2 

Lowering the origin by r intervals 

(1 20) changes 2x' to 2z' + A r 

(IS) and changes 2(x') 2 to 2(x') 2 + 2r2x + AV 

Now let us apply these principles to the computation by 

Method III in Table 7.2, where i = 5 and the origin was taken at 
Method interval in which any frequency occurs. 

;X ««own w— »k T - «' S 

“ “i p„iU«. II ». moo. it op, ™»< ».lo« o! , » « 
be negative which you will find is a nuisance when a computing 

In tin, columns W ~l WJ*- 
the values vou would obtain if you made a specified shut (r 
the value of A. Verify these from the data and then ven y 
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they agree with the values in the check columns: 


A r 

Xfx' 

from data 

Check 

W) 2 

from data Check 

37 

1 

147 

= 188 - 41(1) 

635 = 970-2(1)(188)+41(1) 

47 

3 

65 

= 188-41(3) 

211 = 970-2(3)(188)+41(9) 

22 -2 

270 

= 188 + 41(2) 

1886 = 970 + 2(2) (188) + 41(4) 


Relation of Standard Deviation to Range. Sometimes a large 
error — such, for example, as may result from misplacing a decimal 
point — can be detected by comparing the computed standard 
deviation with the range. The ratio of the range to s is almost 
never smaller than 2 or larger than 6 ; in fact, it is not often smaller 
than 2.5 or larger than 5.5. For example, the range estimated 
from the data of Table 7.2 is 74 — 30 = 44 and s = 8.21, so the ratio 
of the range to the standard deviation is 44/8.2 = 5.4. Suppose the 
decimal point had been misplaced in computin g the variance, so 
the standard deviation was reported not as V67.47 = 8.2 but as 
V6.747 = 2 . 6 . The presence of some serious error could have been 

surmised because rang€? = 44 = 17, which is much too large to be 

s 2 .o 

reasonable. If the decimal point ha d been misplaced in the other 
direction so that s appeared to be V674.7 = 26, the ratio of range 
to s would be || = 1.7 which is small enough to rouse suspicion. 
The comparison of the standard deviation to the range cannot 
prove the correctness of the computation of the standard deviation 
but it can sometimes reveal an error. 

Comparison of Groups. The mean and the standard deviation 
have been developed to this point as summary measures useful in 
characterizing a distribution. The mean achieves this by identify¬ 
ing a point on the scale, usually near the center of the distribution; 
the standard deviation describes the spread of the distribution. 
One can also speak of this mean and standard deviation as char¬ 
acterizing the group on which the scores making up the distribution 
were computed. Thus we may speak of the mean IQ of a class 
being 100 and its standard deviation being 10 . 

The mean and the standard deviation can also be used to com¬ 
pare groups. Suppose, for example, that the mean IQ of one 
eighth-grade class is 103 and standard deviation 4.1, while the 
mean IQ of a second eighth-grade class of about the same size is 
102 and standard deviation is 9.3. The means are approximately 
the same, but the spread of the second class is so much greater 
it will present different and more difficult teaching problems. 
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Both variance and standard deviation are expressed m the 
units of the original distribution. The standard deviation o 
the weights of a group of 12-year-old children from underpr.v. eged 
homes might be compared with the standard deymUon of 
weights of a group from privileged homes because both standard 
delations are in terms cf weight units and the means are not very 
disparate. One could not, in the same way, ask whether a group 
of children are more variable in weight than in height because 
one standard deviation is expressed in weight units and one in 
height units, and there is no basis for comparison. Nor would it 
be reasonable to say that adults are more variable in weight than 
babies merely because the standard deviation of adult weights 
larger than that of infant weights. The means of the two groups 
would be so diverse that one feels inclined to inquire how large 
is the variability relative to the average. T he ratio of the standa 

deviation to the mean, * . is called the coefficient of variation. 

Caution. At this stage there is a great temptation for the be¬ 
ginning student to run ahead of his information, lie has, let us 
say, found that a group of 150 girls had a higher mean score and 
a larger variability than a group of 120 boys on a test of ap¬ 
preciation. That is an observation, true for these particular cases. 
But the research worker is tempted to go further and to say, not 
that “ these girls had a higher mean than these boys and were more 
variable” but that “girls have a higher mean and are more vari¬ 
able than boys,” making a generalization from the particular cases 
observed to boys and girls at large. Such a generalization is know n 
as a statistical inference, being an inference from an observed 
sample to the unknown population from winch that sample 
drawn. The more extended analysis on which such mfei cnees 
must be based will be introduced in Chapter 13 of this text. For 
the present, the student would be well advised not to generahzc 
on his data, and to report his conclusions m the past tense so that 
his readers will understand that they apply specifically to the 

groups studied. 


EXERCISE 7.3 

1 The basic data for computing the mean, variance and standard devia- 
tion for'the midterm scores of each section and the combined sections 
are given on page 100. Complete the computations and write your 

results on the blank lines. 



100 MEAN AND STANDARD DEVIATION 

I II III Combined 

group 

From the ungrouped data 
of Table 6.2 on page 67 

2/X 2257 1425 1349 5031 

2/X 2 126,979 77,291 67,569 271,839 

X - - - - 

2 f(X - 50) 207 75 -51 231 

Zf(X - 50) 2 3779 2291 2669 8739 

X 

s 2 

s 

From grouped scores 
of Table 6.3 with .4 = 52 
and t = 5 

2 fx' 

Zfix'y 

x 

s 2 

s 

2. You will notice that there is very close agreement between the results 
obtained from the grouped and the ungrouped data. Why is the agree¬ 
ment not perfect? 

3. Do the three sections stand in the same order with respect to the mean 
as they do with respect to the median? 

4. For each Section find the value of X — Xm 

Section I - Does there seem to be 

“ II - fairly close agreement 

“ III - between the mean and the 

Combined group- median? Are they identical? 

5. In terms of the standard deviation, place the three groups in order of 
variability, from most variable to least variable. Do the same in terms 
of the interquartile range. Is the order the same? 

6 . Compute the ratio of the range to the standard deviation for each 
section and for the combined group, working (a) with the ungrouped 
data and (b) with the grouped data. 

7. Suppose you were reviewing the work of someone else and you found 
the statement that 2A' = 920, N = 30, and X = 3.06, it would be 
clear that a mistake had been made because these three numbers are 
inconsistent. If you found the statment that X = 23.4 and = 16.9, 
you would not be able to say that the results were correct but you would 


24 4 -22 6 

122 82 120 324 
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certainly have no evidence that they were not. For each of the follow- 
. y nf J ata ; n w hich there is an inconsistency indicating that a 

“^to e^lyt probably been made, write M on the blank 

line If there is no evidence of inconsistency and the data are quite 

possibly correct, write P on the line. 

- (a) X.m - 32; X.»-61; X.« - 63 

_ (c) N = 30; if A = 6, 2/ = 23; if A = 5, 2x' = 18 

_ d) JV = 37; X=12; 2(X-12) = 26 

_ e) JV = 60; X = 7; 2X = 420 T = „ 

_ (f) The extreme scores in a distribution are 3 an , ’ 

_ (g) The ^extreme scores in a distribution are 5 and 36; X = 29; 

- (h) V= 7 60; 2(X-8) = 33; 2(X-9) = -27 

_ (i) N = 80; 2(X - 12) = -15; 2(X - 11) - 

r\ m - 100 - T = 31* 2(X — 31) 2 = 502 
— 00 N : 100; X = 31; 2(X - 30* = 502; 2(X - 35)* = 421 

_ (1) AT = 30; 2(X - 25) = 120; X = 25 + W - 29 

(m) X.u = 52; X. w = 60; X.n = 73 


Mean and Standard Deviation of Combined Groj from Those 

xszstJXrfXz Pz ~ 

to indicate the number of subgroups. 

( 7 . 22 ) N e = Ni + Ni + • • • + 

(7.23) 7 C = ^ (W.r. + N*7, + ■■■ + Nj>) 

n oat A = 7 .- 7 , or in words, d, represents the amount by 
(7 ' 24) w 'hTch the mean of the subgroup differs from the mean 

of the combined group 

(7.25) (N, - 1)«! = (Af. - «■» + <*• - ' - • + (W ‘ ' 1)S ‘ 

^ + iV |(% + N 2 <% + - • - + Nkd k 
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TABLE 7.3 Routine for Obtaining X c and s c for a Composite Group 


Subgroup 

N 

X 

s 

AX 

d 

(N - 1) s 1 

NcP 

I 

41 

55.05 

8.27 

2257 

2.64 

2736 

286 

II 

27 

52.78 

8.95 

1425 

.37 

2083 

4 

III 

28 

48.18 

9.77 

1349 - 

4.23 

2576 

501 


96 



5031 


7395 

791 



x c = 

5031 

9G 

52.406 






95 si . 

= 7395 + 

791 = 8186 






Se '■ 

/8186 
V 95 

= V86.17 = 

9.28 




The use of these formulas will now be illustrated by application 
to the data of Table 6.3 on page 69. There, of course, the original 
scores are available, and direct c ompu tation from them produces 
the values X c = 52.40 and s c = VS6.14 = 9.2S. We shall now ap¬ 
ply the formulas just as if all the information at hand were what is 
provided in Table 7.3 in the columns headed N, X, and s. 

The sum of the column N gives us 96 as the value of N e accord¬ 
ing to formula (7.22). 

The sum of the column A r X divided by 96 gives us X c = 52.41 
according to formula (7.23). 

The values in column d are obtained by subtracting X c from 
each value of X. 

The values in columns (N — l)s 2 and Nd 2 are filled in. The 
sums of these columns are added to provide the value on the right 
of the equality sign in formula (7.25). This number is S186. 

8186 

The computation of s c is completed by finding s? = ■ = 

86.17 and s c = \ / S(U7 = 9.2S. ° 


By direct computation from raw data, we had obtained X 0 = 
52.41 and s r = 9.28 and by application of the formulas we have 
the same values. 

No similar procedure is possible for obtaining percentiles or 
percentile ranks of a combined group without recourse to the 
frequency distribution. 


Standard Scores. In Table VIII we notice that the student 
with code number 3 has scores 38, 38, and 39, respectively, on 
the three tests of reading, artificial language, and arithmetic. 
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P this be interpreted to mean that his standing in the group is 
C , 1 . a. three tes ts? By the methods of the previous 

" h T,rthuVd 0 ^tTt P hat in'reaSt r ha1 62® 

,h “ ‘ Sipu.. . »" ”*"”"Cu h “I.‘t‘"«S 

zsi - 

distribution of original scores the group 

Comparing the scores o studen^Nu ^ 2jj ^ aboye 

means shown mi Table 7.4 ^ mcan artificia i language; 

the mean in reading, . ; But can these differences be con- 

and 7.6 above, in arithmetic. liut were not equally 

“S siSbee 0 ,: 

S^rSe^S^ion Of that test, producing the 
values ' These valucs might be Calle<i standaTd SC ° rCS ' 


Test 


Reading 

Artificial language 
Arithmetic 


No. 

3 of Appendix 

Table VIII 


A a 

Score of 

X 

A a - x 

5 

x* 10^ 

A’j - X 

5 

student ■ 

3 





38 

38 

30 

35.7 

40.4 

31.4 

2.3 
- 8.1 
7.0 

7.7 

11.0 

7.3 

.30 
- .70 

1.04 

53 

42 

00 


) 


+ 50 


. 1 r nf overlooking a decimal point, it has 

To avoid the dangt multiply such standardized 

generally been found c0 " vc oug h to give a result from 

deviations by eon*; numbti• la t ^ final ^ T1 „. moat 

which we feel justified P is jo. To eliminate 

convenient number to use for this pmp 
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the negative signs, we add some number large enough to make 
all results positive. The number 50 is ordinarily, but not neces¬ 
sarily, used for this purpose. The transformed scores thus pro¬ 
duced are also standard scores and to avoid confusion we shall 
refer to them as transformed standard scores. The transformed 
standard scores for student Number 3 in the final column of Table 
7.4 agree with his percentile ranks in indicating that his standing 
was highest in arithmetic and lowest in artificial language. 

When scores on several tests are to be averaged and it is desired 
to give them all the same weight in the average, they should be 
put in standard form before taking the average. If raw scores 
are averaged, the test with the largest standard deviation will 
exercise the greatest weight in the composite. 

In the previous example, the three standard deviations (7.7, 
11 .0, and 7.3) were not very diverse, and so the advantage of 
transforming scores to standard form before averaging — if scores 
were to be averaged — is not very great. Suppose, however, that 
a teacher wants to average scores on three class tests in which 
the means and standard deviations are 

r, = 9 s, = 3 

A 2 = 41 Sz = 7 

X 3 = 63 s 3 = 2 

Consider the four students listed in Table 7.5 and note that, if 
raw scores are averaged, the students all seem to have the same 
standing but that, if scores are transformed to standard form before 
being averaged, it is clear that student D excels the others by a 
considerable amount. The individual scores and the standard 
deviations here were chosen deliberately to produce an effect 
somewhat more striking that you are likely to encounter in 
practice. 


TABLE 7.5 Raw Scores and Standard Scores for Four Students on Three 

Tests (Artificial Data) 


Test 

A 

5 

Haw score 1 

lor stuilent 

Standard score 

for student 

A 

a 

C 

I) 

A 

■n 

C 

l) 

I 

11 

3 

If. 

7 

•} 

15 

70 

43 

27 

70 

II 

11 

i 

11 

50 

57 

33 

50 

03 

73 

39 

111 

«;:* 

•) 

r, i 

on 

5S 

till 

40 

35 

25 

SO 

Sum 



117 

NT 

117 

117 

100 

111 

125 

i so 

Mean 



30 

30 


30 

53 

47 

42 

63 
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EXERCISE 7.4 

, For ^ 

indicated, and then change these numbers to percents. 

Number of Percent of 
students students 


Range in symbol form Range in scores 


Below X - s 
Between X — s and X 
Between X and X + s 
Above X + s 
Below X - 2s 
Between X — 2s and X 
Between X and X + 2s 
Above X + 2s 


Below 43.2 

Between 43.2 and 52.5 
Between 52.5 and 61.8 
Above 01.8 
Below 33.9 

Between 33.9 and 52.5 
Between 52.5 and 81.1 
Above 81.1 


Above a -r . . 

2 - : ' a " ied 
To standard scores and average those standard scores. 

Mark 
for Jones 

89 
27 
38 
72 

3. Below are given John’s Wghest in relation to the 

;“up 0f for h lch 8 X andVwere computed? On which lowest? 

Test 

English usage 
Algebra 
U.S. History 


Test 

A 
B 
C 
D 


80.3 

25.4 
37.3 
65.2 


5.6 

8.4 

12.1 

4.3 


M ark 
for Smith 

82 

24 

46 

68 


Mark 
for Brown 

75 

28 

40 

63 


X 

s 

73.2 

12.6 

58.6 

4.2 

41.3 

6.8 


»■•*->- 

On which lowest? 

John’s score 

72 
61 
50 



The Frequency Distribu¬ 
tion — A Summary 


In a sense most of the material presented up to this point has 
dealt with the frequency distribution — tabulation, graphing, 
scaled variables, measures of central position, measures of vari¬ 
ability. Certainly all of Chapters 5, 6, and 7, and considerable 
portions of 3 and 4 dealt with aspects of the frequency distribution. 
Nor will the rest of the book turn away from this extremely im¬ 
portant topic. Before taking up problems of correlation and re¬ 
gression which involve two variables, it will be wise to summarize, 
amplify, and consolidate what has been said about the frequency 
distribution of a single variable. The discussion in the present 
chapter relates chiefly to the distribution of a scaled variable. 

Summary Measures. In the two preceding chapters we have 
learned to describe frequency distributions by certain summary 
measures such as the mean, median, various percentiles, the differ¬ 
ence between two percentiles, and the standard deviation. Other 
additional summary measures will be described in this chapter. 

With so many measures at hand, the reader may well wonder 
what selection of them he should make for adequate description 
of a frequency distribution. Some guides will be presented in the 
following paragraphs. It may be pointed out at once that, if a 
distribution is fairly symmetrical (a matter which is discussed 
later in this chapter) and has its largest frequencies near the middle 
of the distribution, nearly as much information can be conveyed 
by giving a measure of central position and a measure of vari¬ 
ability as by presenting the entire distribution. The question still 
remains as to which measure of central position and which measure 
of variability are to be chosen. 

Measures of Position. The mean, median, and the percentiles 
all refer to points or positions on the scale of the frequency dis¬ 
tribution, so that they may all be called measures of position or 
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location. The mean and median in particular commonly occupy 
a central position in the distribution so that they are called meas¬ 
ures of central position. These two measures are also called aver¬ 
ages because they convey the meaning usually associated with the 

concept of the average of a group of numbers. 

Another summary measure, which is usually associated with 
the mean and median as both a measure of central position and 
as an average, is the mode. It should be noted, however, that the 
mode does not necessarily have to be either central or even unique, 
and that it can be computed when there is no scale. It is that value 
which occurs most frequently in a distribution, just as the style 
of dress or hat or coat which occurs most frequently in a given 
season is called the mode. In a grouped frequency distribution, 
the mode is considered to be at the midpoint of the interval in 
which the largest frequency occurs. If two or more adjacent in¬ 
tervals have the same frequency, and it is larger than the fre¬ 
quency in other intervals, the mode is considered to be at the mid¬ 
point of the entire range covered by such intervals. In a smooth 
frequency curve, the mode is the abscissa corresponding to the 
highest point on the curve. Two kinds of mode should be dis¬ 
tinguished. The mathematical mode is a concept of considerable 
interest in theoretical work and is far more useful than the crude 
mode However, the mathematical mode is difficult to obtain; 
it can in fact, be obtained only if the equation of a theoretical 
frequency curve is known, so that its maximum point can be 
found. This is almost always a task calling for some mathematical 
erudition. Therefore, the mathematical mode is not discussed 
in elementary treatises, and the term “mode” is ordinarily applied 

to what is more properly called the crude mode. 

The crude mode is the measure described iu the first three 
sentences of the preceding paragraph. It is extremely easy to 
obtain, requiring no computation but merely inspection of the 

dlS Two other averages may be mentioned in passing. Problems 
concerning time, rate, and distance - or time, rate, and work - 
often require the use of what is called the harmonic mean. I rob- 
lems that relate to a variable which is changing over a period of 
time at a fairly constant rate (not constant amount often re¬ 
quire the use of the geometric mean. Rate of growth of an oigan- 
ism or a population is such a variable, further description of 
these means or presentation of their formulas is outside the scope 
of this book. These means are seldom needed, but when they arc 



108 


THE FREQUENCY DISTRIBUTION 


needed there is no alternative. Any person who has a problem in 
averaging rates would do well to consult a text in which the 
geometric and harmonic means are discussed. 

The Choice of a Measure of Central Position. Let us begin 
by looking at certain situations in which there really is no choice 
because not all three measures — mean, median, and mode — can 
be computed: 

1. If the variable is expressed in unordered categories only, 
the mode is the only average which can be found. Suppose a 
panel of judges is deciding which of four exhibits shall receive a 
prize and they vote as follows: 

Exhibit A B C D 

Number of votes: 2 13 1 4 

It is quite correct to say that B is the mode of this distribution. 
There is no mean and no median. 

2. If the variable is not scaled but consists of a set of ordered 
categories, there is no mean but a rough median can usually be 
found, and a mode. 

3. If the variable is scaled but has one or both ends open, 
there is a mean but it cannot be computed. The median and the 
mode can usually be obtained. An example of an open-ended 
distribution would be one in which the highest interval is desig¬ 
nated as “75 years and over,” or “more than $10,000,” or the 
like; or the lowest as “less than 5 years,” or the like. 

Certain qualities which are desirable in an average are possessed 
by both mean and median and so do not furnish criteria for choice 
between them: 

1. An average should be easy to comprehend and interpret. 
The mean, median, and mode meet this requirement. 

2 . It is desirable that an average be easy to compute. None 
of the three is difficult to compute. However, it might be noted 
that to obtain either the mode or the median it is necessary first to 
organize the scores into a frequency distribution (which is easy 
enough to do on an IBM sorter even if the number of cases is very 
large), but this step is not necessary for obtaining the mean. 

3. It is desirable that the average should not be greatly affected 
by the process of grouping scores into intervals. Such grouping 
has very little effect on either mean or median unless the number 
of cases is quite small. However, it often affects the mode greatly. 
For an illustration, look at Table 6.2 on page 67. Here the fre¬ 
quencies are so scattered that the modes are not clearly defined. 
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Group I has two modes at 58 and 62, while the distribution for 
the combined sections has three modes at 47, 58, and 0- \\ hen 
the frequencies are grouped in intervals of 5, the position of the 
mode changes. In fact, as different beginning points are chosen 
for the interval, the mode shifts. To illustrate the mstability' of 
the mode as scores are grouped into intervals, the modes for the 
data of Table 6.2 under various conditions of grouping are s own 

here: 


I 

58, 62 
63 

59 

60 
61 
62 


II 

52 

63 

54 

60 

51 

52 


III 

47, 49 

48 

49 

50, 60 
51 
47 


All 

sections 

47, 58, 62 
48 

59 

60 
51 
62 


Ungrouped frequencies 
Highest interval 66-70 
Highest interval 67-71 
Highest interval 68-72 
Highest interval 69-73 
Highest interval 70-74 

4 It is desirable that the average fall in the middle of the 
,. * n Xhe m ean and median meet this condition but the 
distribution. end of the distribution where 

itcannot serve satisfactorily as representative or ty pe of that 
diSt 5 ow having disposed of the less weighty issues we come to 

i Lend the mean varies less from one sample to another than 
In general, tle mea eat advant age to the mean. 

the median does, and this 8 8 of dra wing 20 small 

Table 13.2 on page 216 presentre {or each both 

samples from the same pop “ “ , at this tablc will reveal that 

£ ZZ Sid lei variability from sample to sample than did 

ii * .n. 

ends instead of in the mi , esence 0 f SU ch extreme scores, 

to sample than the me i . ^ p thjj distribution> suggests the 

e prssibmty C that C in another sample a few very high scores might 
occur without compensating ^LdTii ' such a way 

2 . It is helpful to r b e v a a formula '-sod on all the scores in 

liSo 6 X her Z mode nor the median possesses this 
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characteristic. Both of them depend only on scores in the interval 
in which they fall. Consider for example the eleven values in 
row A. If the values below 15 are made much smaller, as in row 
B, the mean is lowered but neither the median nor the mode 
reflects that change. If the scores above 15 are made larger, the 
mean is increased but not the median or the mode. 



X 

X.to 

Mode 

A. 9, 10, 13, 14, 15, 15, 15, 16, 18, 19, 21 

15 

15 

15 

B. 2, 3, 6, 8, 15, 15, 15, 16, 17, 17, 18 

12 

15 

15 

C. 9, 10, 13, 14, 15, 15, 15, 19, 26, 35, 38 

19 

15 

15 


There are at least two distinct advantages which the mean 
possesses because it is an algebraic function of all the scores in 
the distribution. The first advantage is that it has been possible 
to develop a great deal more mathematical theory about the mean 
than about either of the other two averages. A second advantage 
is seen when several separate groups are to be combined into one 
single group. To obtain either the median or the mode of the 
combined group, it is necessary to construct its frequency distribu¬ 
tion. The mean of the combined group can be obtained from the 
means and the A r ’s of the component groups. 

3. When extreme scores occur at one end of a frequency dis¬ 
tribution, careful thinking about the meaning of the data is nec¬ 
essary in order not to use a measure of central position which will 
misrepresent. Consider Table 8.1. Here the mean, median, and 
mode for the two districts and for the combined group are as 
follows: 



District 

District 

Both 


A 

B 

districts 

Mean 

256.5 

240.8 

248.6 

Median 

201.0 

201.6 

201.4 

Mode 

175.0 

175 

175 


Quoting only the mean would give a fantastically wrong impres¬ 
sion, for in each group about three fourths of the families had 
incomes lower than the mean income. A few atypical families with 
relatively high incomes have affected the mean unduly. For these 
data both median and mode are of genuine interest and should 
be reported. 

If a teacher wishes to compare the average of her class on a 
standardized test with published norms for the grade, it is better 
to have both norm and average expressed as medians because 
one or two atypical scores might influence the mean too greatly. 


Ill 

VARIABILITY 

TABLE 8.1 Number of Families Having a Given Dollar Income in Two 

Magisterial Districts 

(AdaDted from Mary B. WiUeford, Income and Health in RemoU 
M Arena Teachem CoUege, Columbia Universrty, Bureau of Pub- 

li cations) 


Number of families 


Income in 
dollars * 


Both 

districts 



Each entry 


represents the midpoint of an interval 50 units wide. 


5n Terms of Its Variability. Several 
Describing a Dl ^ U1 ° bilit have bee n described - the total 
different measures of ^ability na ^ ^ standar(1 de _ 

range of scores, some in erp genera l the standard 

viation. Each has its spec a usehulness^l ^ ^ measureg ^ 

deviation and the m ^belOercentiie range and the mcdia n belong 
are used together, w i t h either average, 

together; the total range y how the total range is in- 

Look at Ta ' H I case so that it grossly exag- 

flated by the presence of one exirc 
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gerates the spread of the whole. Examine the figures for various 
measures of variability on which the data of Table 8.1 are based. 


Measure computed 
from original 
ungrouped scores 


District 

A 


Total range 2959—0 =2959 

Range of middle 96% 754—34 = 720 
Interquartile range 260-157= 103 
Standard deviation 215 


District 

B 


1311-0 =1311 
890-50 = 840 
242-164= 78 
181 


Both 

districts 


2959-0 =2959 
870-43 = 827 
247-161= 86 

199 


When even 2 percent of the families are eliminated at each end 
of the distribution, the range for the combined districts drops from 
2959 to 827, thus giving a fairer representation of the scattering 
in the group. 

The relation between various measures of dispersion for four 
distributions of different form is seen in Figure 8-1. The dotted 



Fig. 8-1 Position of Decile Points in Frequency Distributions of Different Forms. 

Dotted lines are erected at the decile points. 

Brace extends from Q/. to Or with center at median. 

Large arrow indicates mean. 

Small arrows indicate points at distance of one standard deviation on either side of mean. 

ordinates are erected at the decile points. The brace below each 
figure extends from A r . 2s to X lb with its cusp at the median. 
Tlie mean is indicated by a large V, the points one standard de¬ 
viation from t lie mean by smaller v’s. 

Now take a card or piece of crisp paper and mark on its edge a 
distance equal to the standard deviation of Figure .4. Lay this 
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distance off along the base line to estimate the length of the range 
in standard deviation units. Do the same for each of the four 
figures. You will find that the ratio of the range to s is about 5* 
for A, about 4} for B, a little less than 3 for C, and about 3 5 fo 
D In general, you are unlikely to encounter a distribution in 
which the range is more than 6s or less than 3s. It is impossible 
for it to be less than 2s. In Figure A, the interquartile range s 
about |s; in B, about |s; in C, slightly more than 2s; in D 
slightly less than 2s. Obviously, the relation of these measures of 

variability depends on the form of the distribution. 

Describing a Distribution in Terms of Symmetry or Skewness 
If the graph of a distribution can be folded along an ordinate at 
the median in such a way that the two halves of the figure coincide 
nerfectlv, that distribution is called symmetrical. In such a dis¬ 
tribution the mean and median are identical. If there is a_smg e 
mode, it is also identical with the mean and median. Of course, 
absolute symmetry is practically never 

but the concept provides a convenient method of descnbmg . 

tributions in a general sort of way. . . , 

When X - Xm is positive, a distribution is said to be positively 

nkpwe.d or to have positive skewness. 

When X - X M is negative, a distribution is said to be nega¬ 
tively skewed or to have negative skewness. , , f . „ 

The terms “skewed to the right” and “skewed to the left 

are sometimes met but they are ambiguous and should be avoided. 
Wri““ who use these terms are not agreed on their meanuig, some 
apply in gone term and some the other to the same type of dis- 

tri vSous statistics are available for measuring skewness, but 
are not used often enough to justify describing them here. 

Describing a Distribution in Terms of Its Peakedness Figure 
8 2 shows six histograms all of which arc symmetrical yet the six 

are obviously dissimilar. Their 

peakedness ’ R i . i m g _3 \ s called mesokurtic. A 

distribution which goes up to a high, thin peak and has long tails, 
Se A in Figure 8-2, is called leptokurtic. A squatty distribution 
like D in Figure 8-2, or one which sinks down m the middle, like 
B and F is called plutykurlic. These terms are of no practical 
fm^ortun’ce and need not be memorized. They are offered hero 

for your amusement. 



114 


THE FREQUENCY DISTRIBUTION 


9 



Fig. 8-2 Symmetrical Histograms Illustrating Different Degrees of Kurtosis. 


Measures of kurtosis exist but are not needed often enough to 
warrant describing them here. Suffice it to say that the figure 
cv 4 printed below the sketches in Figure S-2 is such a measure; 
and that a 4 - 3 is zero for a mesokurtic or moderately arched dis¬ 
tribution; «4 - 3 is positive for a leptokurtic distribution and 
negative for a platykurtic distribution. 

Figure S-3 shows five distributions which differ both as to 
symmetry and as to skewness. The figure a 3 below them is a 
measure of skewness such that a- 3 is positive for a distribution 
which is positively skewed; is negative for one which is negatively 
skewed; and is zero for one which is symmetrical. 

U-shaped, I-shapcd, J-shaped Distributions. These terms are 
vivid but not explicit. A distribution like F in Figure 8-2 or C in 



«,= 1.40 a }- 1-10 ct — 1.10 ar j= —.45 

£>•<-3 = 1.58 £**-3=.Gl a 4 -3=.61 a 4 -3 = -.43 

Fig. 8—3 Histograms Illustrating Different Degrees of Skewness and Kurtosis. 
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BIMODAL DISTRIBUTIONS 

Figure 8-1, with a mode at each end of the distribution, is called 

TT *hnnpd because it resembles the letter U. 

‘ A distribution which has a mode in the middle, no matter 
whether it is symmetrical or not, has some resemblance to a manu¬ 
script “ i” and is called I-shaped. Sketches 1 and B in Figu.e 8-1, 
sketches A, B, and C in Figure 8-2 might be called I-shaped 

A skewed distribution with mode at one end only at either 
end is termed J-shaped. Examples are B, C, and D in Figure 

^Bimodal Distributions. Distributions having two pronounced 
modes are unusual and, perhaps for that very reason interesting. 

M«”* S <h, ,<». U, which . I* hjM "» c 

skillfully than his left, Durost 1 computed for each of 1300 chil 
dren a handedness ratio which is positive for right-hande pels >• 

predominance of right-handed children. 

350 r 


Fig. 8-4 Bimodal Frequency Distribution- Handed¬ 
ness Ratios. (From Durost. p. 302 op. cif.) 
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* 1 1 1 Handedness Ratio 


Table 8, shows a " IT tZ'SZS 
tnbution in measures a 15 pcrio d. Relatively few 

fajsh^ItdTod^ cloudiness, most of the days were recorded 
as either very clear or very cloudy. 

, i „♦ r.f u battery of objective group testa of manual 

* 1:100 —■ " 

Monographs , 16 (October, lJil)# 
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TABLE 8.2 Degrees of Cloudiness as Observed at Greenwich during the 
Years 1890-1904 (Excluding 1901) for the Month of July* 


Degree of 
cloudiness 

Frequency 

10 

676 

9 

148 

8 

90 

7 

65 

6 

55 

5 

45 

4 

45 

3 

68 

2 

74 

1 

129 

0 

320 

Total 

1715 


* From Gertrude E. Pears?, "On corrections for the moment-coefficients of 
frequency distributions when there are infinite ordinates at one or both of the 
terminals of the range.” Biomctrika, 20A (1928), 33G. 

The Normal Curve. The “normal” curve is a theoretical dis¬ 
tribution of great importance which will be treated more fully in 
Chapter 12. It is a smooth curve, not a histogram; it is sym¬ 
metrical, and moderately peaked. It has an unlimited range. 
Mean, median, and mode are identical. The semi-interquartile 
range is about two thirds of the standard deviation. In this par¬ 
ticular curve, the standard deviation is the abscissa of the point 
of inflection, as illustrated in Figure 8-5. 



Fig. 8—5 Standard, Deviation in a Normal Curve. 


The Effect of Increasing Every Score in a Distribution by the 
Same Amount or of Multiplying Every Score by the Same Amount. 

On the following page is a list of nine scores and certain statistics 
computed from them. 
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Scores 

Mean Median 

Mode 

Range 

5 = 

s 

X 

X +4 
2X 

3X 

1 2 3 4 7 0 9 9 10 

5 6 7 8 11 13 13 13 14 

2 4 6 8 14 18 18 18 20 

3 6 9 12 21 27 27 27 30 

G 7 

10 11 

12 14 

18 21 

9 

13 

18 

27 

10-1 =9 

14 — 5=9 
20-2 = 18 
30-3=27 

12.25 

12.25 

49 

110.25 

3.5 

3.5 

i 

10.5 


In the second line, X + 4, each of the original scores lias been 
increased by 4. Examine the statistics computed for this row, 
compare them with the corresponding statistics in the first row, 
and formulate a general statement about the effect of adding a 
constant amount to every score in a distribution or subtracting 

a constant amount from every score. 

In the third line, 2X, each of the original scores has been dou¬ 
bled and in the fourth line, 3X, tripled. Examine the statistics 
for these lines, compare them with the corresponding statistics in 
the first line, and formulate a general statement about the effect 
of multiplying (or dividing) every score in a distribution by the 


Increasing (or decreasing) every score in a distribution by a given 
number will have no effect upon any measure of vanability (range 
inlerpcrccntile range, variance, or standard deviation) but u l in¬ 
crease (or decrease) by that number every measure of position (mean, 

TYlfdittfl TTlodc. OT dill) pCTCCfltllc) • . , 

Multiplying (or dividing) every score in a distribution by a given 
number will have the effect of multiplying (or dinding) by that number 
e ery measure of variability except the variance and every measure of 
pZZ. The variance will be multiplied by the sguare of the given 

"“"Geometric Representation of Various Statistics of a Frequency 

Distribution Visualizing the position of a statistic on a histogi an 
Distribution, visuam & v find it worth while to ex- 

« cm. ltcmcm"cr .h,C U, .1 

which are scores, are found on this horizontal scale. The nmdum 
v is almost always found between the mean, A, and the mode. 

,hc p..W.n Ot .h« Him v.l~ 

divides the area into two equa pai s.^statistic. The vertical 

base line. d ° f e \"“ t ZerZ with the largest frequencies 

'ZS&SZZ m^ on the base line, but the length of that 

vertical line is not the mode. segment which 

. A measure of ^ 

is a portion of the horizontal axi.. 
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Fig. 8—6 Geometric Representation of Various Statistics. (Shaded area is percentile 
rank of Score C.) 


the standard deviation s and the interquartile range. The segment 
AB represents the range. 

Percentile rank of a score is represented by area to the left of 
an ordinate at the point on the base line which represents that 
score. Thus the shaded area represents the percentile rank of 
score C. 

In Figure 8-G, X lies to the left of A'. 50 , indicating that the 
mean is smaller than the median, or X - A’. 5 o is negative. There¬ 
fore, this distribution is negatively skewed. 









The law of Regression tells heavily against the full hereditary 

transmiln of any gift. Only a few out of many eh drenwould 

1,0 likely to differ as widely as the more exceptional of the two 1 ar 

cuts The more bountifully the Parent is gifted by nature, the 

more rare will be his good fortune if he begets a son who is as 

richly endowed as himself, and still more so if he has a son who is 
^ i x « ir>v.roiv Hut the law is even-handed, it levies 
endowed S^m^UnS X- ^ of badness as of good- 

an equa the extravagant hopes of a gifted parent 

Ihlt his children wdll inherit all his powers; it no less discounte¬ 
nances extravagant fears that they will inherit all Ins weakness and 

sszri.'»«««■ u»-' »■» 


, e .x mnxt elementary scientific as well 

iS a TaUffo tTo^s"! score of an individual on one variable 
practical effort to u.e score of the same in- 

a means of determining, . ’ tl determination is 

vidual on another var,ab e of {rom its cost per 

tact, as in finding t e ' • , ion is inexact, as in estimating 

ard. At other times the d ^ & coursc in college mathematics 

of his score in a college entrance test in the 
ame subject. , 0 Variable from Another. 
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of one provides information concerning the other, the variables 
are said to be related , or associated, and a relationship is said to 
exist between them. In such a matter as estimating a student’s 
class performance from his score on a prognostic test, the test 
score comes first in point of time and so is said to be used to predict 
performance in the course. From such literal usage, the term 
prediction has spread to other situations in which, without any 
implication of futurity, a score on one variable is used to furnish 
information about another variable. Thus one may try to “pre¬ 
dict” a child’s weight from his height, or his mental age from his 
score on a test of paragraph meaning. 

The purpose of this chapter is to consider inexact “prediction.” 
However, the treatment of inexact prediction will be clearer if 
exact determination is discussed rirst. 

The Exact Straight-line Relationship. Exact determination 
can be readily illustrated by the relationship between salary rate 
and annual earnings. Table 9.1 contains a listing of monthly 
salary rates and annual incomes for five employees. Once the 
salary rate for an employee is known, the annual income is fully 
determined, assuming, of course, that the employee works a full 
year at the same rate. These data are presented graphically in 

TABLE 9.1 Monthly Salary Rate and Annual Income of Five Employees 


Employee 

Monthly 
salary rate 

Annual 

income 

A 

$200 

$2400 

B 

225 

2700 

C 

250 

3000 

D 

300 

3600 

E 

400 

4S00 


Figure 9-1, where each point represents the paired monthly salary 
rate and annual income for one employee. The graph provides 
a ready means for ascertaining the annual income of an employee 
whose monthly salary is some amount other than those given in 
Table 9.1. 

To make a graph like that in Figure 9-1, horizontal and vertical 
scales for the two variables are drawn on coordinate paper. If no 
coordinate paper is at hand, a grid may be constructed by drawing 
horizontal and vertical lines at selected, usually equidistant, 
points on the two scales. Contrary to the advice given in Chapter 
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formula for a straight-line relationship 

4 these scales do not need to begin at zero. A point for each in¬ 
dividual is obtained in the way in which a point was located for 

a line graph in Chapter 4. 



* ’ w - W 

Fi „. Relation of Annual Inconto .0 M^thly Salary Rat. fa, Five Employee, A, 6, 

C, D, and E. 

The Formula for an Exact Straight-line Relationship. The 

Annual salary = 12 times monthly rate 

vivid to the e>e, , r “ ann ual salary and 

r oth'er letter ^Ld for "monthly rate,” and write 
a symbolic formula as ^ 

• i cuVwtitute Dairs of values from Table 9.1 
The reader may wish to substit P = 3000 aiK l 

in this formula. Thus for included in 

3000 = 12(250). Now cons.de tMxthjemp oy^ _ ^ 

Table 9.1. If X - 2 ’ 5 ' 2 "^ own ' in Figure 9-1 is a straight line. 
The graph of - degree, the graph of their 

When both and the relationship is called linear 

relationship is a straig term8 x and y are of first degree ; 

or sometimes rechlmea . , x-Y, XY\ and l 3 are 

X*. and XY are of between vari- 

of third degree.) <ormu a. • Wi u no t be considered in tins 

S.° f £^of'-h —- are railed nonlinear or 

curvilinear. of the linear relationship, suppose 

tha^eacl'of the’hvv employees listed in Table 9.1 reee.ves a year- 
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end bonus of §300 in addition to his monthly salary. The annual 
income Y then becomes 

Y = 300 + 12X 


It will be left as an exercise for the reader to develop a table 
and graph corresponding to the situation just described. This 
formula contains two variables, X and Y (quantities which may 
change from individual to individual), and two constants, 12 and 
300, which are the same for all individuals. 

The general expression for a formula whose graph is a straight 
line has the form 


( 0 . 1 ) 


Y = a + bX 


where X and Y arc variables changing from individual to in¬ 
dividual, and where a and b represent numbers which are constant 
for any one set of data. Formula (9.1) is a general algebraic expres¬ 
sion of the linear relationship between two variables. 

Independent and Dependent Variables. The usefulness of such 
a relat ionship as is expressed in formula (9.1) becomes clear when 
for one or more individuals the variable Y is unknown and the 
formula is used to determine the unknown Y from the known X. 
In obtaining the formula it was necessary to know values of both 
variables for a group of subjects but, once the relationship has 
been obtained, it can be used to determine Y for other subjects 
for whom only the values of A' are known. The variable whose 
values arc known is called the independent variable; the one whose 
values arc determined from the formula is called the dependent 
variable. 

In most practical situations, the logic of the situation makes 
clear which variable is considered independent and which de¬ 
pendent. Tims the familiar formula C = 2wr suggests that the 
radius of a circle is known or can be obtained by direct measure¬ 
ment, whereas the circumference is to be obtained not by direct 
measurement but by multiplication of the measured radius. Be¬ 
cause. (' = 2-r is an exact relationship, it may be algebraically 

C 

transformed into the equivalent form ?• = --• Here the inde- 

J7T 

pendent and dependent variables have interchanged their roles. 
The new formula suggests that the circumference of a circle is 
known or can be readily obtained by direct measurement and the 
Radius is to be ascertained from the circumference, as in obtaining 
the radius of a circular pipe or cylinder where only external meas- 
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bivariate frequency distribution 

. ic nn<$*ihlp When the relationship is inexact, the two vari - 

Cha in e exact Relationship. In statistical work t^-lat.onships 

the" d^usLn 1 01 ™^ relationships the mexact relationship will 

arithmetic and scon*on ^ of pairs o{ pcores for the 

columns in Table VII ■ ^ nQ simple relationship exists 

same students shows a c0( f c d 2 S, and 52 all 

between them. For examp e, • . tcgt ^ 3 ) but have the 

have the same score on ’ e “ te , t These differences may 
scores 58, 47, and 02 on the midterm tot. ^ scQr(j Qn the 

be accounted for by suc c °""‘ t cly the ability of a student, 

ability may be motivated differ- 

ently in doing their course "™ k ' k whet her in the midst of such 

The reader may be m H' ^to ask ^ ^ ^ ^ 

confusion any relations hip ,. ed in some detail, 

variables. This question wi construct the bivariate 

Bivariate Frequency D.stnbut.om^ ^ ^ ^ of Ap . 

frequency distribution s >°\ step interval is selected for 

Pendix Table VIII, „*% ,e ste P intervals for the independent 
each of the two variable.. . u for the dependent 

variable arc laid off on a horizontal ^ by plafing 

variable on a vertical axi-. drawing in the network of 

the axes on coordinate paper or by dialing 

horizontal and vertical line.-. represented by one 

Each of the 98 * "J u tnt with code number 1 who 

tally in Figure 9-2. Thus the ^ ^ scorc y = 58 is 

has arithmetic score X . n for whi c-h the interval on the lion- 

XaUcafe island the interval on the vertical scale is 55- 50. 
“d in 3^5'jt‘f-"-mg chapter, and mistakes in tabula- 
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tion are easily made. The task is less boring and results are 
usually more accurate if one person reads while another tabulates. 
Cardboard guides can be used by both reader and recorder to 
reduce errors and prevent eyestrain. The reader needs some device 
to keep his eye from jumping to the wrong row or wrong column 


Arithmetic T«t Score 



O' 

1 

i n 

• 

o 

15-19 

a 

• 

o 

CN 

25-29 

30-34 

35-39 

40-44 

fy 

70-74 







/ 

/ 

2 

65-69 






/ 

/ 


2 

60-64 





HII 

III 

mm 

mi 

21 

. 55-59 

o 




/ 

m 

Mil 

m 

III 

19 

i 50-54 
«• 

£ 




/ , 

mi 

in 

MJU 


18 

r* 

g 45-49 

£ 

•u 

i 

/ i 


/ 

II 

Ml 

mi 

n 

II 

18 

^ 40-44 


/ 

II 

1 

hi 

III 



10 

35-39 



1 

I 

i 

1 



4 

30-34 





i 

1 



2 

25-29 




H 





2 

f* 

i 

i 

4 

6 

24 

23 

27 

10 

98 


Fig. 9-2 Midterm Test Scores Shown in Relation to Arithmetic Test Scores (98 cases). 
(Data from Appendix Table VIII.) 

on the data sheet, and to keep him from reading the same values 
twice or omitting others. The recorder will find it an advantage 
to prepare a strip of cardboard with an exact duplicate of the scale 
for the midterm test. As the arithmetic score is read, the recorder 
moves this strip so that the scale appears directly at the left of 
the appropriate column, with zero point properly aligned. It is 
a simple matter then to place a tally in the proper cell. 

When a tally has been recorded for every individual, the mar¬ 
ginal frequencies are obtained by adding across each row and down 
each column. 

1 he tallies jointly make up a bivariate frequency distribution , 
also called a scatter diagram and a correlation chart. The set of 
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marginal frequencies at the right of this figure obtained by adding 
across the rows, is the frequency distribution for the midterm test 
scores; and as we are using the letter Y to denote these scores 
these frequencies are labeled /„ The set of frequences at the 
bottom o the figure, obtained by adding down the columns, are 
fr equencies related to intervals of the X scale and so are 
labeled /. Frequencies in the interior of the figure represent he 
joint occurrence of a particular value of X and a particular value 

nf Y and so could be called fxy . 

A cursory examination of the bivariate distribution shows that 

high scores fn the arithmetic test tend to be associated with high 
scores in the midterm test, and low scores in the arithmetic test 
with low scores in the midterm. Because of this immediate evi¬ 
dence of relationship, one may speak of the two variables as being 
co-related or correlated. A measure of correlation will be described 

“ ’S.THSrS Miniature. Beta, U. «*. * 

in a study of the relationship between scores on the two tests, 

working out that relationship for a very small number of students 

mav serve to clarify principles. Nine students have been chosen 

Their code numbers and scores on the two tests have >ecn ret oh < t 

in he first three columns of Table 9.2. The remainder of that 

table shows computations based on these scores for the purpose 

ensuing discussion before attempting to follow these compu - 
tl0n in Figure 9-3 the nine pairs of scores have been plotted as a 

sisssiig 

iSSfsSSSHS 

This line is to be p ac “best fit”? Obviously, no 

nine points. But what 1! > m ., , a j| t |, e points will 

matter where the line is irawn in , ()(j R ( ii s ,. re p a ncv be- 

deviate from it to some extent and the 1 (wlli( ,|' , ms been 

tween the actual midterm score of an m« 

denoted V). «m| ^^ 3 f s ” F tilde,” ”esti- 

Egression V”). H the relation 
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TABLE 9.2 Computation of Regression Equation to Predict Midterm Test 
Score from Arithmetic Test Score for Nine Cases from the 

Data in Table VIII 


Code 

number 

of 

student 


X Y 

Arithmetic Midterm 
score score 


X* 


Y 2 


XT' 


? Y-? ( Y-?y 


10 

37 

44 

1369 

1936 

1628 

55.2 

-11.2 

20 

17 

37 

2S9 

1369 

629 

44.5 

—7.5 

30 

25 

52 

625 

2704 

1300 

4S.8 

3.2 

40 

32 

58 

1024 

3364 

1856 

52.5 

5.5 

50 

IS 

47 

324 

2209 

846 

45.0 

2.0 

60 

25 

52 

625 

2704 

1300 

48.8 

3.2 

70 

27 

50 

729 

2500 

1350 

49.9 

0.1 

so 

36 

53 

1296 

2S09 

1908 

54.7 

-1.7 

90 

36 

61 

1296 

3721 

2196 

54.7 

6.3 

Sum 

253 

454 

7,577 

23,316 

13,013 

454.1 

- 0.1 

T 253 

no i 

9(13, 

,013) - 

(253) (454) =on 




= Jo.l 

6 ' 91 

[7,577) - 

- (253)* 

- ivwu 



^3 

II 

1 

= 50.4 

a = 50.4 

- (.539)(28.1) = 

35.3 




125.44 

56.25 

10.24 

30.25 
4.00 

10.24 

.01 

2.89 

39.69 


f = 35.3 + -539X 


were perfect , every point would lie on the line and Y — F would be 
zero for every subject. In any reasonably good position of the 
line, some of these discrepancies will be positive and some negative, 
and so it is not enough to make the sum of all discrepancies zero. 
In fact, the sum of discrepancies from the horizontal line through 
the mean of the Y scale would be zero, yet that line does not repre¬ 
sent the trend of the data at all well. The criterion commonly 
adopted for good fit is to make the sum of the squares of the dis¬ 
crepancies {Y - Y) as small as possible. This is the principle or 
the method called least squares. By it the numbers a and b in 
the formula 


? = a + bX 


are computed. We arc now writing V instead of Y because the 
relationship is between the known arithmetic score (= X) and the 
best estimate which can be made of the midterm score (= V), not 
the actual midterm score (= Y). By calculus it can be proved 
that the values of b and a which give the best position of the line — 
that is, “best” by the criterion stated—can be computed by 
formulas (9. 2) and (9.3). 
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Fia 9-3 Midterm Test Scores Shown m Ke.anon 
Z! (Dotled line, pan through mean, of In. two variable,) 


(9.2) 


NZXY - ( SX)(SV) 
b ' NIX- - (2X) 1 


(9.3) 


a = 7 - 

T m tin q 9 thp sums of the columns headed X, > , X 2 , and X) 

— b. .*..«;«» i«-« <»»'"< 

(9.3). The result is the formula to predict Y Iron . ■ 

? = 35.3 + -54X 

this formula to predict the Y score of the 

J “ « Et ° m >» * A< v 

IS 37, we have ^ = ^ + 54(37) . 55 . 2 

Tte v „« r “ bi c 

“cr - "■ - -.. 

stand the use of' J^f the nine students, the point 

If you shouldI plot, you would have nine points lying 

rS ^ that line would pass through the point 




REGRESSION 


128 

whose paired values are X and 7 . Therefore, you could locate 
this line by plotting any two of these points. 

Figure 9-4 shows the regression line superimposed on the scat¬ 
ter diagram of Figure 9—3, and also shows for each of the nine sub¬ 
jects the amount (Y - ?) by which the regression value was in 
error. The discrepancy Y - ? is a deviation from the regression 
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Fig. 9-4 Regression Line Y = 35.3 -f .54X and Errors of Estimate for the Nine Cases 
of Table 9.2. 


line sometimes called an error of estimate , an error of prediction , 
and sometimes a residual error . The reader should check some of 
these discrepancies and compare them with the corresponding 
vertical lines in Figure 9-4. 

For historical reasons which will be explained on page 137, the 
line drawn on Figure 9-4 is called a regression line and the- value of 
b in the equation is called the regression coefficient. 

Before leaving Table 9.2 we should take notice of certain re¬ 
lations among its figures: 

(a) The sum of predicted values of Y should equal the sum 
of actual values except for rounding errors. Here we have 2 Y = 
454 and 2T = 454.1. 

(b) Because of the relationship just stated, the sum of the 
errors of estimate should approximate zero. Here we have 

2(K- f) = -0.1. 
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CHECKS ON COMPUTATION 

A General Routine of Computation for the Regression Formula. 

The procedures illustrated for the 9 cases will now be applied to 

all 98 cases of Table VIII. It is unnecessary to copy do\ui the 

• To hip Q 2 but sums can be obtained directly on a corn- 
scores as in Table 9.A out sums . , needed 

puling machine. With a wide carnage Monroe, all the 

* A 1 in Table 9 3. In subsequent steps in that ta e 

entered as Step 1 in laoieu.o reader 

the computations are shown m convenient form. 

"“'rrr —- - 

found to be 

f-31.6 + .66X 

whereas for the 9 selected cases it was 

f = 35.3 + MX 

«. • nf f or the 98 scores is .06 and for the 9 

The regression coefficient now taUe another small 

selected scores is .54. I computations, he will probably 

sample and carry out t different from these. If, for 

get an equation with cons • • Table VIII, he would 

i« .houiduMtbe ,„,i r. 

find ZX = 316, 2K = 504, 2X -.11390, coefficients 

48.1 + .22X. These dl ^“rate tL v^ion which results 
of the regression formula aspects of the problem of 

from using different samp^ ^ in chapl ers 13 through 16. 

sampling varmbility wi d ^ & computing machine is 

Checks on o{ ^ sums entering into the re- 

used, a check on the accu . next chaptcr yo u will see 

gression formula is essen la . comP uting a correlation coeffi- 

that these same sums are use satisfactory because 

cient.) Recomputing is i„ Table 9.4 there 

a computer may make thesam proof verification of the 

is illustrated one way of obtaim k ^ ^ x _ y are written 
correctness of these sums. - cr gums 0 f squa res can be 

down for each individual, the s • , d wou fd be laborious 

readily obtained on a machine. 1 he 

for hand work. Ncw York; 

. See Pease, Kalliarieo. Hackin' Common of Elen 
Chartwell House, Inc., 1949. 
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TABLE 9.3 Steps in the Computation of the Regression Formula with 
Application to the Data of Table VIII in the Appendix 

X = Arithmetic Score, Y = Midterm Test Score, N = 98 



The Conditional Distribution of Y for a Given X. In Figure 
9-5, the graph of the regression equation has been drawn on the 
bivariate frequency distribution for the 98 cases, a step which 
could not be taken until the computations of Table 9.3 were made. 
Each vertical column contains a frequency distribution of the mid¬ 
term scores, Y, of a group of students who had arithmetic scores, 
A', so similar that they fall in the same X interval. For each of 
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Arithmetic Test Score 



e , g - une of Regression of Midterm Test Score on 

Ab-e cod Be,ow -be Re 9 ,e M ,on 

. Dots indicote position of column meor». 
x Crosses indicate means of three sect.ons. 


Arithmetic Score with Parallel 
Line 198 cases). 


- ^^sSSSlSXS^tSSSSi 

considerable irregularity. frequencies are much 

Now try to uMffje ‘ » «« distrihut q ion {or each value 

larger. There will st‘ll be a d^re thttt X has a 

of X, and this is the distribution of y If N cou ld 

particular value, or the w* 10 ” 0 that the mea „ 0 f this con- 
be made very large, we ™g P in one sense 

Consequently, it seems re stan ,l ar d deviation in a very large 
tributes would havethe^scores in the entire dis¬ 
union TheTommon standard deviation thus obtained belongs 
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TABLE 9.4 Checks on Computation of Sums Needed for Regression and 

Correlation, Applied to the Data of Table 9.2 



Code 

number 



10 

20 

30 

40 

50 

60 

70 

80 

90 

Sum 

Sum of 
squares 


37 

44 

17 

37 

25 

52 

32 

58 

18 

47 

25 

52 

27 

50 

36 

53 

36 

61 


253 454 

7,577 23,316 


X + Y X-Y 


81 

- 7 

54 

- 20 

77 

- 27 

90 

- 26 

65 

- 29 

77 

- 27 

77 

- 23 

89 

- 17 

97 

- 25 

707 

- 201 

56,919 

4,867 


2 X + 2F = 2(X + Y) 

2X - 2F = 2(A* - Y) 

2(X + 1’) + 2(A' - F) = 22X 
2(A + F) - 2(X - F) = 22F 
2J(X + F)» + 2(A r - F) 1 = 2(2A’ S + 2F>) 

2(X + F)* - 2(X - FT* = 42XF 

2(X + F) 1 = 2X 1 + 22 XF + 2F* 


253 + 454 = 707 
253 - 454 = - 201 
707 - 201 = 506 = 2(253) 
707 + 201 = 90S = 2(454) 
56,919 + 4,867 = 61,786 and 
2(7,577 + 23,316) = 61,786 
56,919 - 4,867 = 52,052 
4(13,013) = 52,052 
56,919 = 7,577 + 23,316 + 
2(13,013) 



equally to the Y distribution for each separate X , is the standard 
deviation of the conditional distribution of Y, and is equal to 



The symbol s v . x is read “s sub y point x” or “s y point x.” This 
value could be obtained by computing ? for each case in the dis¬ 
tribution and subtracting it from Y but is much more easily ob¬ 
tained by first computing 


(9.5) S (F-?) 2 = 


N 


mY * - (2 YY - C NZXY-VX)Vm 

A 2,1 -(21) NXX 2 - (XX)* 


and substituting the result in (9.4). 

Because s u . x measures the extent to which actual scores depart 
from the regression line, this standard deviation is a measure of 
the errors one would make in estimating or predicting the score 
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gain of INFORMATION 

• 1- • 1 i Kv n«p of f. For this reason s u . z is railed the 
S(a On r Figure ‘- 

W ' t Variat'ion Ibo^the Regression Line. Mutations near the 

S rri 

s„ = 9.3. Common closely t ) ian the mean 

line describes the trend of the 1 scores morei ^ ghouM be 

does, the variation of scor ^ ro sc about the general mean 
less than the variation of those . a i r deviations of 

of the table. In fact U.e sum of Urn scjuarr of ^ ^ _ of 

scores from the mean, -( deviations of scores from 

5sr.*l .<—* ..— 

2(f -7) 2 . In symbols, __ 

(9 6) v 0 --rr = 2 (V-f) ! + 2 (r-lT 

To verify tills relation, return now to Table 9.2. Compute 

S(r _ Yy - sr* - ^ = 414.2. Compute 2(f - T)- by sub- 

i f ii. n q values of ^ } squaring the 
a- V 4 from each ot tlie J > aiUL * ’ 1 , 

resuRis .3,8. £*£+ 
; 0Un,1 iS - 4.4,. Rounding 

~ - “* r«f“ !■” «ar- rr. 

By formula (O.h), i j . tfnnwledee of X. In order 

Gain of Information about Y from Knowledge f “ (| - x , iduals is 

to ascertain what m puna ion a scores, consider lirst the 

provided by informationabout , A * In this 

situation when knowledge of A • thc y dis- 

.— 

-■y^l rs 

and »« S.O. u ‘^ f whom the arithmetic score 

Therefore, m general, foi *ununi 
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is 10, nearly all the midterm scores would presumably be below 
54.2, a score only slightly higher than the mean score for the total 


S Consider now the conditional distribution for X -40. Here 
? = 58.2 and so 7 - 2 S „, = 42.2 and f + 2s,., = 74.2 We would, 
therefore, expect nearly-all students for whom the arithmetic 
score is 40 to be above the mean of the total group. In addition 
the comparison of the distribution for X - 10 and for X = 40 
suggests that the latter is so much higher than the former that 

groups having these X scores differ greatly. _ 

These considerations show that knowledge of X provides much 
more information than would be available without it. Knowledge 
of the X scores of individuals is helpful in making selections for 
employment, or for school, when the goal is high achievement on 


certain Y scores. 

Applications of the Regression Formula. The concept of con¬ 
ditional distribution of Y for a given X can now be used in the 
problem of estimation or prediction which was raised at the be¬ 
ginning of this chapter. The problem raised there was, “Given a 
value of X for an individual, what is the best estimate of Y for the 


same individual?” . 

The concept of conditional distribution states that Y for a 

given X is the best estimate of this Y mean of individuals with that 

same value of X. For this reason it is a best estimate of the Y score 


of an individual with given X. 

The concept of conditional distribution is useful beyond mere 
estimation since it provides a method of placing an individual with 
respect to an average, or expectation. Thus an individual with a 
given X who scores higher than the corresponding ? exceeds his 
expectation, and he falls below his expectation if his score is below 
the appropriate P. 

To illustrate these concepts consider the regression of midterm 
scores on scores in arithmetic which was computed in Table 9.3 as 
Y = 31.6 + .66A'. Suppose an individual has an X score of 33, as is 
the case of Student 1 in Appendix table VIII. By the regression 
formula the best estimate of the corresponding Y score is 53.4. 
Since his actual score was 58, he did better than he might have been 


expected to do in view of his mathematical ability^. 

A similar analysis shows that Student 2 performed more poorly 


than might have been expected. 

Let us now apply the regression formula to the study of the 
groups of students in sections I, II, and III. For each of these 
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groups we can compute the mean of the arithmetic test scores. 
Call this mean X,. From X* a mean of estimated values 
namely, — can be computed by the formula 


(9.7) 


7 ,• = a + bXi 


srsrs s=- 

F = 31.6 + .66^ 


Student 

group 


8ection I 
Section II 
Section III 
All Students 


Mean of arithmetic 
test scores 

J 

35.85 
30.07 
20.32 
31.38 


Mean of estimated 
midterm test scores 

F 

55.20 
51.44 
49.0 
52.3 


Mean of actual 
midterm test scores 

7 

55.05 
52.78 
48.18 
52.40 


... 0 f actual scores is impressive, 

estimated scores agree wi ^ {or in dividuals. One might 

This approximation is far g sections performed 

conclude, therefore, that relate ability 

on the midterm test very ne y ™ recm cnt between means 

as shown on the arithmetic test. shown a l so in 

oi «... -I “i”'. ***.. — 

Figure 9-5, where means of actu 


crosses. . . .. Depression Coefficient. The 

Graphic Representation of the R 8^ twQ mcans X and 7, 

regression coefficient j>» t0 K e * (Geometrically, b is the 

fully determine the line o ta j cc any two points on the 

do-pc of the line of regression. .. tliroug h the lower and 

line of regression, draw a ''“-^ '^ 0 ^ 0,0 these lines 

a vertical line through the upperP? "X med. The ratio of the 

horizontal is the ** 4 ^ " <’ 


vertical segment 
6 = horizontal segment 


v 

h 


(9.8) 
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Figure 9-6 shows two regression lines and two such triangles 
drawn on each. For the left-hand figure, both h and v are consid¬ 
ered positive, so b is positive and the line has positive slope. When 
b is positive, the regression line is directed in a general southwest 
to northeast direction. For the right-hand figure, v is considered 




Fig. 9-6 Geometric Representation of the Slope of a Regression Line, b =*= -• 

positive and h negative, so the quotient v/h is negative and the 
line has negative slope. When b is negative, the regression line 
has a general northwest to southeast direction. Readers who are 
unfamiliar with directed lines may find help in Chapter 8 of 
Walker, Mathematics Essential for Elementary Statistics. A 
further discussion of the slope of a line is found in Chapter 10 of 
the same book. 

The Assumption of Linearity. The discussion in this chapter 
has restricted itself to fitting a straight line to a set of data and 
the reader is no doubt thinking of situations in which the trend 
is not linear. The graphs in Figure 9-7 are interesting in this 
respect. These curves were fitted by Henry A. Ruger to data 
originally collected by Sir Francis Galton, and show for about 
7000 adult males the way in which certain physical measures 
change with age. Each of the little black dots shows the mean 
value of the measure for a group of men of the specified age. Con¬ 
sider the first of these, showing the trend of visual acuity. Ap¬ 
parently if we had a group with age range from about 20 to about 
40 years, the regression would be nearly linear and with a slightly 
negative slope indicating a slow decrease in visual acuity with 
advancing age. But if the age range were from about 20 to 80 
years, no straight line would fit the data very well. 

A discussion of nonlinear curves is beyond the scope of this 
book. A more or less intuitive decision as to the appropriateness 
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of fitting a straight line can be made from a scatter diagram in 
which there has been some grouping. If the means of the columns 
cluster about a straight line as they do in Figure 9-5, then a linear 
relationship may be presumed and a straight line should be fitted. 

Oriein of the Term “Regression.” Most people think 
gression equation” and “regression coefficient” rather queer terms 
unless they know the peculiar situation in which they were first 
used. Francis Gallon, being profoundly impressed by Onsin o/ 
the Species which his cousin Charles Darwin published in 1859, 
set himself to explore various aspects of heredity. In the process 
he gathered data on the stature of parents and their adult offspring 
and plotted a bivariate frequency distribution in which each entr\ 
represented one family, one axis being scaled for the average height 
of the two parents and the other for the average height o all the 
adult offspring. As might be anticipated, this chart showed clear > 
that taller parents tended to have taller children than did shorter 
parents. However, Gallon also noted the surprising resu t that 
the children of very tall parents were, on the average, not quite 
so tall as their parents, while the children of very short parents 
were, on the average, not quite so short as their parents In othei 
words, extreme parents have children more mediocre t an them¬ 
selves. What is it, pondered Gallon, which is dragging he human 
ace b^k toward mediocrity? He commented that, “However 
paradoxical it may appear at first, it is theoretically a necessary 
fact and one that is clearly confirmed by observation, that the 
stature of the adult offspring must on the whole, be mcie ™dmcrc 
than the stature of the parents; that is to say, more near the i 
rtlie median] of the general population.” In 1885 Gallon pub¬ 
lished his famous paper on “Regression toward Mediocrity in 
Hereditary Stature” and introduced the term Unes oj regression.' 

Thp term still holds. , . 

Not only is historical interest and clarification of meaning to 

he found in Gallon’s use of the term regression but there is also an 
important corrective for a certain kind of loose thinking. 1 cop e 

. 1 * — 

of My Life, I/melon- _ letters and labours of Francis Gallon, 3 vols., 

Hiomelrika, 13 O‘J20), 25-45, Hel( , n m. Walker, Studies in the History of 

Cambridge University ress, , • j. « j ()2 9 The original work* by Francis 

Statistical Method, The N ^^^^Coditary stature," Journal of the 
Galton are. IU-Krc».**io i 246-263; “Family likeness in stature, Iro- 

A’dLropoloyical histitut* ( j, -Co-relations and their measurement, 

oj Pie ** son,, 4. (.»». • » .45; 

and Natural Inheritance, 1889. 
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EXERCISE 9.1 

1. Examine each of the curves in Figure 9-7 and describe relationships 
which seem to you to be nearly linear, stating for each the age range 
over which linearity appears to hold. Describe situations in which a 
relationship appears not to be linear. Interpret what you see in terms 
of the way physical change appears to be related to increasing age. Do 
you find any variable which cannot be estimated from age? 

2. (a) From the data of Table VIII, the student with code number 30 has 

the scores X = 25, Y - 52. Is his Y value above or below what 
you would estimate by the regression equation Y = 31.6 + .66X? 
By how much? 

(b) Answer the same questions for student number 68 for whom 
X = 39 and Y = 44. 

3. Is there any fallacy in the comment that the general level of intelligence 
appears to be rising because in several families where the parents 
are of very low intelligence the children are considerably brighter 

than their parents? 
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„as studied fro ™ *| ,e [n the present thapter, this tela- 

the variation about t • 1 t of the correlation 
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vidmg a measure of 1 P and on the other hand, as providing 
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a measure of the muuiciiivy Dre cedine chapter, prc- 

The Correlation Coefficient ^preced , vhen 

diction was achieved y m Iine prediction is exact and 

all actual values are on the ? perfect. We shall 
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is zero when prediction is exact, for then every Y - ? is zero 
and so the numerator is zero. When knowledge of X contributes 
no information about Y, the numerator of the ratio is equal to 
the denominator and the value of the ratio is one. It should be 
noted that the numerator represents the sum of squares of the 
deviations of Y values from the regression line, while the de¬ 
nominator represents the sum of squares of deviations from the 
mean; and the numerator will never be larger than the denomi¬ 
nator. . . 

This ratio could serve as a measure of precision of regression 

estimates but, for reasons which are partly historical, the closely 
related coefficient of correlation is more commonly used for that 
purpose. The correlation coefficient, commonly denoted r, will 
now be defined by four algebraically equivalent formulas, each 
of which serves to reveal certain important aspects of this 

measure. 

First, a formula which relates r to the sum of squares of 


residuals is 
(10.1a) 


z(r- ?) g 

2(K-7) 2 


From this formula it can be seen at once that, when prediction is 
exact, r- = 1 because the ratio 


S(K- f ) 2 

2 (Y - 7) 2 


is 0; and when prediction fails entirely, r 2 = 0 because the ratio 
is 1. In other words, r 2 = 0 means no correlation and r- = 1 
means perfect correlation. It can also be seen that r 2 can never 
be larger than 1 because the ratio cannot be negative. Thus r 
cannot be numerically larger than 1 but can have values between 
-1 and +1. This formula gives no clue as to whether correlation 
is positive or negative. It does not provide a convenient pattern 
of computation. 

As correlation is a mutual relationship between two variables, 
you are probably thinking that if X had been predicted from 
Y, r should also be related to the precision of that prediction. 
You are correct, and the formula 


(10.1b) 


S (X - X) 2 

s(x - Xy 


is algebraically identical with (10.1a). 



the correlation coefficient 

The best-known formula for the correlation coefficient is 
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(10.2a) 


r = 


Ti(X-X)lY-Y)_ 


which becomes more vivid to the eye and 
remember when x and y are substituted for X X 

Then 


(10.2b) 


r = 






Obviously, this formula has the numerate^ as the formula 

for the regression coefficient. i therefore r and b must 

and the denominator l ° T ^ S ^ ion Une has positive slope, 

have the same sign. Whe g tpndencv for large values of 

r and 6 are both positive, and there is * t* ndencytor g 

the two variables to occur slope , r and 5 

together. When the regre , { large va lues of one 

are both negative and there « a tendency for^g^ ^ ^ 

variable to occur in connect provides a con- 

Neither formula (10A nor formula ^ ^ provides 

' venient routine for compu a • era bly longer and looks 

the easiest routine for formula (10.2b). 

JESlrSfb.Shed from tom*. d« * ■» 
manipulation: 

MT.XY- (ZX )(ZY) _ 

(10.3a) r = 

It'wUl'b^used 

sections of this chapter. . terms 0 f coded devia- 

Formula (10.3a) may e ^ or igin instead of gross scores 
tions (*' and y') from an ar 1 rary f ula remaining un- 

(X and Y), the general pattern oi 
changed. The formula is then written as 

2 ====== 

(io.3b) r = } 1 

This •» w. 

1‘n'tad.ted.rriU. -i—' • -*-> 
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A fourth formula equivalent to those already presented shows 
the relationship between the correlation and regression coefficients: 

(10.4a) r " b | 

Since s„ and s, are both positive, the formula shows once more 
that r and b must have the same sign. A formula equivalent to 

formula (10.4a) is 

(10.4b) 6 - r Z 

When the two variables have equal variability so s„ = s„ the 
correlation coefficient r is identical with the regression coefficient b. 
This was the situation in Francis Galton’s study of hereditary 
stature. He used the letter r for the regression coefficient, but 
later it became almost universally used for the correlation co¬ 
efficient instead of for the regression coefficient. We have already 
seen that r cannot be numerically larger than 1. There is, how¬ 
ever, no such limitation on the size of b. If, for example, r = .8, 

s x = 3 and s„ = 12, b would be 3.2. 

Computation of the Correlation Coefficient without Plotting. 
The computation of the correlation coefficient is very similar to 
that of the regression coefficient and the two are usually obtained 
together. For a first illustration, let us turn to the data of Table 9.2 
on page 126, where we already have 2(F - F) 2 = 279.01 and 

2 (Y - 7) 2 = 23,316 - = 414.22. Therefore, by formula (10.1) 



279.01 

414.22 


.326 


and r is either V.326 = 4- .571 or — V^326 = — .571. Whether 
r is positive or negative we cannot tell from formula (10.1). 

From Table 9.2 we also know that N = 9, 2 AT = 13,013, 
2A = 253, 2F = 454, 2X 2 = 7,577, and 2F 2 = 23,316, and there¬ 
fore by formula (10.3) we have 

9(13,013) - (253)(454)__ KV1 

r “ V[9(7,577) - (253) ! ][9(23,310) - (454) 2 ] 

Although the two procedures are algebraically identical, formula 
(10.3) is better to use because it. involves easier computation and 
less rounding error and because it indicates whether r is positive 
or negative. 
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To oo„p««. U» k“'¥£. M 

NXXY - (2X)(SF) = 331,437 

\_NZX 2 - (XXyJ.NXY 2 - (SF) 2 ] = (499,313) (819,11^^^^ 

- vW T^ " 639 ' 52G 

r - vpvsx 2 ’ 
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there was only one r ®8 r ® ssl ° prediction makes much 
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formance on a job from performance. How- 
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ever, in other situations thc .TA predict X from Y as to predict 
is just as important to be able P da ta for Appendix 

Y from X. Suppose, for ^^^'^^Tpupils appear. One of 
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them has taken only t e o{ b j s score in arithmetic 

teacher wants to obtain an The secon d has taken only 

reasoning by a regression equation. n equation u needed 

the arithmetic reasoning test ant b taUon score would have 

to estimate what his anthroetic ^ nothing whate ver 

been if he had taken that ^' * e regress ion equation is more 
in this situation to sugges-that one>8 

important or more meamngfu ^ ^ liave been using will 

The equation ? =■ o ■+ h sub script to each of the 

now be changed slightly by attaun g 

constants, making it 

(10 5 t ? = a v* + 

The analogous equation to estimate X from Y is 

X = dxu + 

(1 °‘ 5b) • subscript is the same as the order of 

The order of letters in the s • P in the subscript names 

variables in the equation i i * d letter in the subscript 

the variable to be estima cc , j estimation is made, 

names the variable by means of which 
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TABLE 10.1 Steps in the Computation of Correlation and Regression 

Coefficients from a Scatter Diagram 


Step 


Procedure 


1 


r> 


Plot the data on a bivariate frequency chart. This will ordinarily be done 
by tallies as in Figure 9-2 on page 124, and the result will be confusing to work 
from. If the number of cases is at all large, the tallies should be replaced by 
numerals before the next steps are taken. The symbol /* v is used to represent 
the number of cases in a cell at the intersection of a row and a column. 

Add across each horizontal row to obtain the marginal frequency for that row 
and record the result at the right in a column labeled f v . 

Add down each vertical column to obtain the marginal frequency for that 
column and record the result below the chart in the row labeled /*. 

Find 2/„ and 2/ x and verify that they are equal. Record that value as N. In 
Figure 10-1, N — 98. 

In the column labeled y\ record an arbitrary set of coded scores for the Y inter¬ 
vals. In the row headed x\ record an arbitrary set of coded scores for the X 
intervals. If you are using a machine, it is convenient to place the arbitrary 
origin (coded 0) in the lowest interval in which a frequency occurs as in Fig¬ 
ure 10-1, because then all deviation values will be positive. When working 
by hand, some computers prefer to place the arbitrary origin nearer the center 
of the distribution in order to have smaller numbers to work with. Then special 
attention must be paid to signs. 

Compute 2/ v j/' and 2/„(y') t exactly as you obtained similar values when com¬ 
puting a standard deviation. In Figure 10—1, these are 497 and 2849, respectively, 
and for ease of identification have been designated^ .4 and B. 

Turn the table around and compute 2/ x z' and 2/ x ( x'Y in the same way. In 
Figure 10-1, these arc 479 and 2551, respectively, and can be identified as C and E. 

For each row separately, compute 2/ X|r r' and enter the result in the column 
headed 2x\ For example, the computation for the third row from the top is 
4(1) + 3(5) + 10(6) + 4(7) = 119. The sum of that column must agree with 
the 2/ x x' obtained in step 4 and will thus furnish a check on the correctness of 
the preceding step. In the example, the sum of this column is 479 and has been 
marked C to show its relation to the sum of the row C. In practice it is helpful 
to make a small movable scale exactly like the fixed horizontal scale of the corre¬ 
lation chart, with code numbers inserted, and to place this scale immediately 
below each row so you can easily multiply each by the corresponding x' and 
sum the products. Turn the table around and, in similar manner, find 2/ x ,y' 
for each column and enter the results below the table in the row headed 2t/'. 
The sum of entries in this row must be identical with that previously obtained 
for 2/,!/'. In the example, both sums are 497. 

In the column 2z', multiply each entry by the corresponding y' and record in 
the final column of the table headed i/'2x'. The first entry in the column 2x' is 
13, actually obtained as 1(6) 4- 1(7). When this is multiplied by y' = 9, wc 
have 9(13) = 117 and 1(6)(9) + 1 (7)(9) = 117, which is the value of 2 / x *z't/' 
for the two cases in the top row of the table. The sum of all the entries in this 
final column will be Zf lv x'y' for the entire table. Turn the table around. Mul¬ 
tiply each entry in the row 2?/' by the corresponding x' and enter in the final 
row of the tabic. The sum of the entries in this row will also be 2/ x ^r'y' and 
thus provides a complete check on that part of the computation. In the example, 
these two numbers have been labeled D. 


Complete the computation of X, F, s x , s„, r. 
sums into the appropriate formulas. 


b tr or 6 X * by inserting the obtained 


The regression coefficient b ux is already familiar, but its formula 
will be repeated here in order that the similarity of form of b vx 
and b xv may be recognized: 

N2XY - (SX)(2F) 

MSA'* - (2A') 2 


(10.Ga) 
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the second line of regression 


N'LXY - (S X)(S10 

bxy - - N 2 Y 2 _ ( 27 )* 


(10.6b) 

Not , ,u, u»«. “5 

in the formulas 

(10.7) a vx = y - h v*% and 


a 


= Y- b„Y 


Arithmetic Score 
«2 


fyy ' 

f ,( y ') 2 

Zx' 

y ' Xx 1 

16 

162 

13 

117 

16 

128 

II 

88 

147 

1029 

119 

833 

114 

684 

101 

606 

90 

450 

90 

450 

72 

288 

78 

312 

30 

90 

42 

126 

8 

16 

14 

28 



K 


7 = 27 + 5(W) = 52 36 
7 - 7 + 5(W> - 31.44 

^ = 84.66 *» “ 

3 - 54.06 ** = 74 

5 13,503 

**** “ 5 * 20,557 
5 13,503 

b *» “ 5 32,193 


.66 

.42 


N2*y = 98(2567)-(4971(479)-13.503 
N2y» = 98(2849)-(497)’ = 32,193 
N2 x* = 98(2551) - (479) - 20,557 
(32,193)(20,557) = 661,791,501 

1 3,503 = 13.503 = 53 

r “ V66L79h50i 25 < 725 


Fi S . 10-1. Computation from a Scatter Diagram. 
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Computation of the Correlation and Regression Coefficients 
from a Scatter Diagram. In the previous chapter, we considered 
one method for obtaining the sums of squares and sums of 
products needed in the regression formula, and these same sums 
are the values needed in the correlation formula. The method 
described there is convenient to use when data are given in a 
list such as the lists of Appendix Tables VIII and IX and when a 
computing machine is available. If no machine is at hand and 
N is large, that method would be very time consuming. It would 
be difficult to use when data are presented in a scatter diagram 
such as that of Figure 10-1. We shall now consider a method 
of computation more appropriate for data displayed in a scatter 
diagram. This method provides a complete check on computation 
of Z,Y, ST, and 2 XY but does not check the computation of 2X 2 

and ST 2 . . 

Figure 10-1 shows a computation for the correlation coefficient 

between midterm test scores (1) of 98 students in a first course 
in statistical method and scores on an arithmetic test taken at 
the beginning of the term. Table 10.1 describes in words the 
steps in that process. You should examine the two tables to¬ 
gether, identifying in Figure 10-1 the outcome of each step 
described in Table 10.1. 

In Figure 10-1 it happens that i, = i v . Usually these two 
step intervals are not equal and care must be taken to multiply 
by the proper numbers in computing regression coefficients. A 
summary of computation formulas using the letters A, B , C , 
D, and E to represent the results of specific operations performed 

in Figure 10-1 may be helpful. 


NE - C- 


(10.8) 

s z = i z \ 

N{N - 1). 

(10.9) 

s„ = i, \ 

/ NB - A 2 
' ;Y(.V - 1) 

(10.10) 

y — _ 

> 

1 

l 

22 

^ 4 

?> 

► ». 

i 

•< 

> 

i 

(10.11) 

b -is. 

ND - AC 

Dyj - . 

A E - r- 

(10.12) 

, ir 

b zu - v- • 

l v 

XI) - AC 

NB - A- 



EXERCISE 


149 


EXERCISE 10.1 


1. Using the bivariate distribution of Figure 10-1, place the arbitrary 
origins at A' x = 27 and A' u = 57 and verify the following: 

XU = 87 2/„y' = - 91 = 57 

S/^x') 2 = 287 2/ v (y') 2 = 413 

' Complete the computations until you obtain the same values of r and 

b v , as are found in Figure 10-1. . . . 

2. With the same frequency distribution and with arbitrary origins at 

A' x = 32 and A\ = 02, carry out the same computations. 

3 Verify the values of r indicated below Figures 10-2 to 10-7. 

4 *. Find r and both regression equations if preliminary computations 

have yielded the following results: 


N = 250 
A x = 15 
Ay,’ = 50 
ix = 3 
ty = 5 


= 160 

Xi/ = ~ 125 

2 (*T = 812 

2(y') 2 = 914 
2xV = - 34 


0 t 2 


12 

II 

10 

• 

? 

• 4 - 


1 

4 

5 

2 
I 
• 


ID II 12 /y 





• 21 5/ 112 101 * 02 21 
r = 0 


0 1 2 2 4 8 6 7 I 0 10 11 12 



r~. 75 


| I 2 I 4 0 « 7 I • 10 11 12 /„ 


f I 2 I 4 I • 2 8 • 10 11 12 /j 



2 

7 

II 

68 

112 

110 

•7 

IT 

20 

7 

2 

600 



r = .92 

Fig. 10-2 Scatter Diagram, and R.gr.„ioo line, far Faar Di.tribalian, with Different 
Size* of r. 
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Scatter Diagrams Illustrating Various Values of the Correla¬ 
tion Coefficient. No one can judge the size of a correlation co¬ 
efficient precisely from the visual appearance of a scatter diagram, 
but persons who work extensively in correlational material acquire 
a background of experience which enables them to make a rough 
estimate. To help you acquire a little experience with correla¬ 
tions of various sizes, Figures 10-2 to 10-7 have been drawn up. 

Look first at Figure 10-2 in which there are four distributions 
with correlation coefficients of 0, .33, .75, and .92. These are drawn 
from artificial data, primarily to show the positions of the two 
regression lines which are identical with the lines of the means 
when r = 0 and draw closer and closer together as r increases. 

Figure 10-3 shows the joint distribution of language score and 
intelligence quotient for the 109 cases in Appendix Table IX. 


Language Score 



Fig. 10-3 Language Score (X) and Intelligence Quotient (Y). 
Bivariate Frequency Distribution with Two Lines of Regression Drawn, 
r = .583 X = 22.0 t x = 6.51 

Y = 74.9 + 1.60X 7 = 110.1 Sy = 17.83 

X = - 1.45 + .213 Y 

Circles mark means of horizontal rows. 

Crosses mark means of vertical columns. 
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„ f « To see more clearly the relation of the regression 

'Tr.lT- noT ,,d ». —« T ~> "Wf 

represent the means of the various You \YiU now 

gge SdS'howtSrTsLrUne straightens out the irregularities 

of the set of observed column means. = 2 2.0, 

Now, with a blue pend, rows , and the 

the small circles which ^repr ^ t ^ ^ notice that 0 f the 

rrs“i” «»;, •»«»r"®'* v i ? i ° 

and the one to predict X is theneare pupi | s in 

t .K XXX £ ,Q ~ 



. , ... r = _ .72. Experience tells you that 

negative and fairly hig , ’ s \ ng \ e school grade the older 

this is reasonable, ““j 1 t ) ian the younger ones. If a 

st ?jzs:2& — cd ' the corrclation a6C 

with IQ would be nearly zero. 
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Age to Nearest Month 

0 * 0*00 - 


49-51 1 I II I I 

46-48 _2_ 

43-45 _I_ 5_2_ 

40-42 _l_ 4_ _L 2 

37-39 _2__ 

34-36 _I_ 3_j_ 

31-33__2 2 

28-30 115 3 3 


25-27 _ 
22-24 _ 
19-21 
16-18 _ 
13-15 _ 
10-12 


7-9 

L 


I_ 5_ 2 

14 13 


2 2 I 


L!LL1 
1 Ai.1 

j_ 2__ 

j_ j_ 4__ 

_ 2__2 

i i 


2 4 21 31 26 8 6 5 1 

f=—.37 


2111 109 


Fig. 10-5 Age and Reading 
Speed. Bivariate Frequency 
Distribution for 109 Fourth- 
grade Pupils. 


Fig. 10-6 Arithmetic Reasoning and 
Arithmetic Computation. Bivariate 
Frequency Distribution for 109 
Fourth-grade Pupils. 


Arithmetic Reasoning 

! 2 3 4 5 6 7 S 9 10 II 12 
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EXERCISE 

Fieure 10-5 shows a negative correlation (- .37) between 
F ^ ond i n! r srieed for the 109 fourth-grade pupils m Ap- 

age and readmg sp d ^ ghows a low relationship 

pendix Table IA, ana b arithmetic computa- 

(r = .27) between arithmetic reason g = between 

- i—. - •« 

course in statistics. 



w . p FiEure 10 -7 to obtain answers to the following questions: 
Examine Tigure ^ ^ mldtcrm test, did either one 

1 . Of the two persons scoring es 

make the lowest score on the hnat Qn ^ midterm and the 

2. Compare the 8 persons with hig many per80ns wcre in both 

9 with highest scores on the hnai. 

grovips? i: - ouestions lead you to believe that, 

3. Do your answers t0 the at the extremes on one variable are 

almost certain to be equally extrenve onhighest on the 

*• sss v —-« 
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5 . Of the 30 persons who scored below 48 on the midterm, how many 

scored 48 or above on the final? , 

6 . Of the 44 who scored above 50 on the final, how many scored 50 or 

below on the midterm? 


Reliability of Measurement. A very important use of the 
correlation coefficient is to evaluate the reliability of measure¬ 
ments. Suppose a physical education teacher, wanting to measure 
the physical fitness of a group of boys at the beginning of a term, 
records the number of push ups each boy can do. How dependable 
is such a single record? If exactly the same task is repeated a 
week later, there will be slight differences m individual per¬ 
formance. The correlation coefficient between the two records 
is called a reliability coefficient and in this situation indicates the 
similarity of performance of individuals on the repetition of a 
task If this reliability coefficient appears to be undesirably 
low, the teacher may decide to use the average of several scores 
made on different days rather than a single day’s score. 

If the task is an intellectual or psychological one, repetition 
of precisely the same task will seldom serve a useful purpose. 
Two applications of precisely the same history test would be 
likely to yield an almost perfect correlation because each person 
would miss nearly the same questions on both occasions, even 
though the scores were not reliable indications of the students 
knowledge of the subject. In such situations it is customary to 
develop two equivalent forms of the test and to correlate scores 
on the two forms. Each form contains only a sample out of all 
the many questions which might be asked, and so is an imperfect 
measuring instrument. The correlation between scores on the 
two forms gives information about the similarity of those forms 
and measures the reliability of the test. If the two forms are ad¬ 
ministered on the same day, variation in the physical or emotional 
state of the pupils has little effect on the outcome and so this 
reliability coefficient is really descriptive of the test. If the two 
forms are administered on different days, the correlation between 
the scores on the two forms is affected both by the differences 
between the items in the two forms and by day-to-day changes 

in the persons who take the test. 

Sometimes only one form is available and the items are 


divided into two half tests called “split halves.” The correlation 


between scores on the two halves is, of course, lower than the 


correlation between two complete test forms, and it is the latter 
which is meant by “the reliability of the test.” The reliability 
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of the entire test can, however, be estimated from the correlation 
between the half tests. 

Let R = correlation between two comparable test forms and 
r = correlation between two half tests. 


R = 


2 r 


(10.13) Then " “ 1 + r 

Tfci* formula is known as the Spearman-Brown Prophecy formula, 
because it was published by Carl Spearman and by William Brown, 

• nnnprs which appeared simultaneously. 

The P reUability coefficient is very easily affected by the range 
of scores A reliability coefficient computed for pupils in grades 
tZ to nine would be considerably higher than one computed 

for pupils in a single grade only. 

EXERCISE 10.3 

—SHhr sssr 

correlation be expected to be between the two 20-item tests if g.ven 

to 120 pupils instead of 60? of a te3t has been found 

2. Suppose the correlation be reliability coefficient 

to be .45 and the test Efficient might he hope to 

to serve his purposes. to make the test twice as long? 

be achieved if «thened the test 

by adding poorer items than those in the original test 

v—v. a «—“ ;S».” 

on one variable from scores on be used t0 predict 

examinations, or high-sc oo gra » ^ made t0 predict 

performance in college A e environment Xhe accuracy 

delinquency from factors i ma de is called the 

with which a prediction of this prediction . A va - 

mlidity of a measure in m 8 exp] . esscd as a correlation 

lidity, or validity coefficient, making the prediction 

coefficient between the measure used in making 

and the measure which is pre icte . re ii a bility is an evalua- 

Validity differs from reliability in y ^ ^ ^ 

tion of the consistency in rep “ )n of the relationship between 
whereas validity is an eval 
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measures on different variables. Reliability is an inherent char¬ 
acteristic of a measure, but validity is a relationship between 
two measures. Thus in reports on a mental test, the reliability 
will be reported as an aspect of the tests, but validities are re¬ 
ported separately for each of the variables the test may be used 
to predict. Thus a test in mathematics might have a validity 
of .50 in predicting success in engineering school and a validity 

near zero in predicting success as a salesman. 

Validity itself is influenced by reliability and a test with 
low reliability will necessarily have low validity in all its pre¬ 
dictions. 

Personnel Selection. A very important application of the 
correlation coefficient is the standardization and evaluation of pro¬ 
cedures used in selection of persons for employment or for admis¬ 
sion to school. The selection procedure provides a measure by 
which subsequent performance on the job or in the school can 

be estimated. 

The user of the test has two problems. He must select a 
score, or standard, such that all candidates who attain or surpass 
this score are selected and the remaining candidates are rejected. 
He must also check to ascertain whether the test is actually' 
effective in selecting the more desirable candidates and in re¬ 
jecting the less desirable. 

The solution of both of these problems is accomplished by 
relating the scores of the selection test to those of a measure of 
achievement in the occupation or school for which applicants are 
selected. This new measure is commonly called the criterion , 
because it is the criterion against w T hich the selection procedure 
is evaluated. Thus the selection procedure might be an entrance 
examination to a college and the criterion might be college-grades. 
Another example is provided by a test which leads to employment 
in a position and ratings of job performances given by supervisors. 

The correlation between a selection test and a related criterion 
is the validity of the test, or the validity coefficient, because this 
correlation measures the validity, or correctness, of the test in 
predicting criterion scores. 

Table 10.2 has been prepared to demonstrate the effectiveness 
of several validity coefficients in a selection procedure. The 
table consists of six subtables, each of which represents a dis¬ 
tribution of 100 individuals according to several levels of the 
test score and of the criterion. Consider, for example, the first 
subtable at the left. Suppose that the standard for this group 
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has been set to select the higher scoring half of the candidate 
croup Of the 50 so selected, 17 are in the top third of the cri¬ 
terion* croup 16 are in the middle third, and 17 are in the lowest 
third. Ve notice also that the rejected candidates have exactly 
the same distribution according to the criterion. We see, 
fore Tat when the validity coefficient is zero, choice of the higher 
Soup on S score provides no advantage. In fact, one is ,ust 

Effectiveness of Selection Procedures for Three Different 

Validity Coefficients 

100 candidates 


TABLE 10.2 


Number of candidates with test score 


in 


Validity 

coefficient 


r = 0 


Criterion 

score 


Lower 

half 


Upper 
half * 


Lowest Middle 
third third 


r = .50 


r = .80 


Top third 
Middle third 
Lowest third 

Top third 
Middle third 
Lowest third 

Top third 
Middle third 
Lowest third 


17 

16 

17 

9 

16 

25 

4 

16 

30 


17 

16 

17 

25 

16 

9 

30 

16 

4 


12 

10 

12 

5 

10 

19 

1 

8 

25 


10 

12 

10 

10 

12 

10 

8 

16 

8 


Top 
third t 

12 
10 
12 

19 
10 

5 

25 
8 

1 


* Upper half of candidates selected 

t Upper third of candidates selected 

When the validity is O.oO, 
as well off Without using the^ test test score , we find that 

if we choose the upper half of iU e g F» but only g are 

i* *> -: “f■»"««*'»'*»• 

is .80. Table 10 2 it is assumed that we 

In the right-hand colum ()[1 the bas is of test score. 

select only the top third of t g P that the test is 

The first subtable in this column shows at ^ ^ s(jb _ 

not helpful in selection when v'Id subje( , ls are in the upper 

table shows that 19 of the 3 , ( . ent of the selected 

third of the criterion wcre thc up p e r third when 

group, whereas only o0 p 
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half were chosen. Thus a choice of upper third rather than upper 
half provides a distinct improvement in selection when the va¬ 
lidity coefficient is 0.50. A similar gain is demonstrated for the 

validity coefficient of 0.80. . 

The conclusion from this analysis of Table 10.2 is that a test 

is increasingly useful in selection as its validity coefficient in¬ 
creases. If the validity coefficient is positive, selection is improved 

if the standard for selection is set higher. _ 

The situation is analogous when the validity coefficient is 

negative, but in that case we choose from the lower level of test 

score rather than from the upper level. 

Phi Coefficient. Sometimes the scatter diagram consists of 

only two rows and two columns. If the frequencies in the four 
cells of such a table are indicated by the 0 1 

letters a , 6, c, and d , placed as in the 1 
adjacent sketch, the correlation coeffi- Q 
cient can be obtained by a very con¬ 
venient short cut and the result is usually b + d a + c 
called 4> (phi) to inform the reader that it 
was obtained from a four-fold table. 


.b 

a 

d 

c 


o + 6 
d + c 


(10.14) 


ad - be _ 

* = V(a + b) (d + c) (b + d) (a + c) 


Suppose that out of 60 men, 23 approve a particular proposition, 
while out of 80 women, 27 approve it. Is there a relation between 
sex and approval of the proposition? 

Men Women 


37(27) - 53(23) 

* V3b(90H60) (80) 


-.047 


Approve 

Disapprove 


23 

27 

37 

53 


60 80 


50 

90 

[140 


The relationship is practically zero, certainly negligible. 

Correlation among Ranks. Sometimes where there is no satis¬ 
factory device for scoring a trait, individuals can be placed in a 
rank order in respect to the degree of the trait they exhibit. In 
such a situation it is possible to say for any two individuals which 
one is higher on the scale for the trait, but not possible to say 
how much higher. The reader will readily think of many situa¬ 
tions of this kind, as, for example, the attempt to place in order 
of merit the performances of contestants for an award in some 
contest or to rank persons on some intangible such as “friendii- 
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ness” or “conscientiousness.” If the group is large, even deter¬ 
mining the order of individuals becomes difficult, so a correlation 
coefficient based on rank order is obviously most useful ivhen 

"“The c or r el at ion ^between two sets of ranks is called rank order 
correlation, or Spearman’s ran* order coeifkienMbecause it was 
originally proposed by Carl Spearman) or sometimes rho (be 
caiSe he denoted it by Greek letter p). The formula 


(10.15) 


R = 1 - 


6 Sd 2 

N{N 2 ~ T) 


where N is the number of individuals ranked and d is the dif- 

f0r S U upposel2 cakes submitted in a competition at a country fair 
have been ranked by two iudges with results as shown in Table 

10.3. Then 


6(40) 

R - l ion 44 - hi 


123 on 
= TO = 


TABLE 10.3 


Computation of Coefficient of Correlation among 

Assigned to 12 Cakes 



Rank assigned by 



Ranks 
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Now let the ranks assigned by Judge I be denoted X and those 
assigned by Judge II be denoted T. Then 


2X = 78 
2X 2 = 650 


ST = 78 
ST 2 = 650 



S XY = 630 


7R 2 

S y* = 650 - = 143 

Z* - 630 - sm - 123 


, 123 _ 123 _ sfi 

and r = V (143) (143) 143 

This computation illustrates the fact that the rank order 
coefficient and the product moment coefficient applied to the 
ranks are identical. However, if scores are transformed to ranks, 
the product moment correlation of the ranks (which is the rank 
order coefficient) is almost certain to be different from the product 
moment coefficient of the original scores. 



The outcome of a correlational study is often difficult to interpret 
because the correlation between two variables not only reflects 
fhe strength of the intrinsic relation between those variables but 
may also 8 be affected by other circumstances, such as the re- 

-,.h a. v„i“ “i 

ol eoefbeient. you obUin il you ... ", 

as rs;-r-r.‘: 

correlation between height >nd ..eight •!« hove 

identically the 

sir:;, r jssi rex •“ 

hU g’Td^Hli"^ "If »lu« o, one variable ha. 

no effect on either r or 6 but will affect the constants m the 

regression equations. 
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Statements (1) and (2) call attention to the fact that, in 
computing the sums of products and of squares which are used 
in obtaining a correlation coefficient, no use whatever is made 
of the scales of the variables. However, the scales are needed for 
obtaining the regression equations since the step interval is re¬ 
quired for finding the regression coefficient, and both means are 
required for finding the constant term in the equation. 

Effect of Grouping Scores. Grouping scores into class intervals 
will not greatly affect the value of r or b provided the intervals 
are not made very large. While the most accurate results are 
achieved by use of ungrouped data, computations made by sub¬ 
division of the scales into 10 or 15 intervals give satisfactory 
results for many purposes. The extreme procedure of subdividing 
the scales into two or three intervals and then computing the 
correlation and regression coefficients should be avoided since 
the distortion would then be considerable. 

Variable Errors of Measurement. By now you are aware that 
nothing can be measured with complete precision. Every good 
research worker eliminates all possible bias from his measures 
and reduces accidental errors of observation and measurement 
to a minimum, but he can never completely free his observations 
from accidental error. If these variable errors which remain are 
purely chance errors, they are as likely to be positive as to be 
negative, and to have only a negligible effect on measures of 
central tendency. They tend to increase measures of variability, 
and they tend to decrease measures of relationship. This is an 
important matter. It means that an observed correlation is al¬ 
most always lower than the correlation between true measures 
of the same traits would be. It means that if measurement is 
made with unreliable instruments, the observed correlation co¬ 
efficient may be very low even when the traits concerned are 
closely related. 

The reduction in the size of r because of variable errors of 
measurement is called attenuation. To illustrate its effect, an 
experiment was performed in which purely random errors were 
attached to the language scores and the intelligence quotients 
of the 109 subjects in Appendix Table IX. A variable error 
was attached to each score in this way. Two decks of cards were 
shuffled together and a card was drawn. The number on the 
first card drawn was taken as the error to be added to the score 
of the first child, the number on the second card drawn as the 
error for the second child, etc. Red cards were considered to 
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indicate positive errors; black cards, negative. After every tenth 
drawing the cards were reshuffled. For case 1, the ongmal 
language score was 17, the first card drawn was a six of hearts, 
and so the score plus error is 17 + 6 = 23. (It must not e sup 
posed that the original scores were free from measurernent erro .) 

Results of computing from the original scores and from 
scores with random errors attached were as follows: 


Original 

Scores 

Scores 

with Errors 

21.98 

22.08 

110.04 

110.02 

6.5 

10.3 

17.5 

19.6 

.59 

.20 


Mean language score 
Mean IQ 

Standard deviation, language scores 
Standard deviation, IQ 
Correlation coefficient 

The means have been changed so little by the variable errors 

, 1 n 1 .n In ho carried to four or five places before any 

that results ia ‘ has been very slightly increased, 

difference a P pea ^‘ d d Bot h standard deviations have 

grE Effect' of C Number of Cases. The number of cases on which 

Of one vanable (or both) 1 e on a A straight Une is called 

a row or a column is straight is called nonlinear 

linear or rectilinear- A ^ on age for almost any physical 

“-i-«>*“ - •>* — m ** 

Several such curved regression lines 
have been shown in Figure 9-8 on 
page 138. When a measure of over¬ 
head expense is related to the number 
of individuals involved, the scatter 
diagram often resembles the adja¬ 
cent sketch. In some psychological 
studies there are variables for which 
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a rating at either extreme tends to be associated with low per¬ 
formance or poor adjustment, and a rating in the middle range 
to be associated with better achievement or more healthy adjust¬ 
ment. An extreme situation is represented in the sketch below 
where column means are represented by small crosses and row 
means by small circles. A curved regression line fitted to the 
small crosses would represent the data rather well, so that it would 
be possible to make a fairly good estimate of Y from X } but a 
line fitted to the row means would have little meaning. 






0 

O 

0 










0 0 

0 0 

.\° 

0 

X # 

0 

0 

0 







• • 

.X. 

0 

•X* 

0 

<2* 

0 

.X. 

0 0 
.X. 






• 

0 

• 0 

•X# 

0 0 

1 °.- 


0 0 

0 

• 




• 

• 

• 

X 

0 


( 

) 



0 

0 0 

• 


• 

X 

• 

0 



o 



.X. 

0 

0 


X 

0 

• 

0 




c 

) 


0 

0 • 

X 

0 

0 

• 

• 

• 




0 





0 

0 

0 


Since the correlation coefficient is based on the assumption of 
linearity of regression, it would be practically meaningless in 
data such as are sketched here; in fact, it could be nearly zero 
in such data, even if there were a decided relation between the 
variables. There is a measure of relation which is applicable in 
such situations, called the correlation ratio. It is equal to r when 
regression is precisely linear, and greater than r when regression 
is not linear. The computation of the correlation ratio will not 
be described in this book. 

Selection of Cases. The effect on correlation of some particular 
selectivity in the data can be very perplexing. For example, if 
you were asked what is the correlation between age and intelli¬ 
gence quotient, you might say promptly that it is zero. Yet if 
you should compute that correlation for the 109 fourth-grade 
pupils of Appendix Table IX, you would discover that r = - .72, 
as in Figure 10^4. Why? Because of selectivity of subjects. It 
has already been noted that within any single grade (here the 


165 


eliminating cases near the mean 

, th „ voune er children tend to be brighter than the older 

fourth) tj g tive correlation between age and a measure 

ritowC... H *“ «“"» “ ’ ‘ ya,m " 1 

- =—, -r 

*T, * may .... b. 

Some of m •«, U„, e„ ««r 

will be discussed in the next our set^on a researC h 

Eliminating Cases nea cases an d then eliminates 

worker of cases near the center of the distri- 

L r uton° n if he computes a coefficient of correlation from the two 


Language Score 
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extreme groups, he is likely to get a value numerically larger 
than if he had used the full distribution. 

For example, consider Figure 11-1 in which, for all 109 cases, 
the correlation between intelligence quotient and language score 
has been found to be r s .58. If you should eliminate the 48 cases 
with language score between 19 and 27, leaving the distribution 
of Figure 11-1, the correlation computed for the remaining 
61 cases would be r = .69. It is possible to make a good estimate 
of the correlation in an entire distribution from certain statistics 
computed from cases at the two ends of that distribution, but 

special tables are required for that purpose. 1 

Correlation in a Composite Group when the Subgroups Have 
Unequal Means. Sometimes two groups with widely separated 
means are thrown together to form one bivariate distribution. 
The correlations obtained from a composite group of this sort 
are spurious and meaningless. 

This phenomenon is well illustrated by data gathered by 
Simpson, 2 who studied two groups: A, consisting of professors and 
advanced students in Columbia University; and B, consisting of 
unemployed men in New York missions, none of whom had ever 
held a position demanding a high grade of intelligence. Among 
the tests he administered were these: 

h — Auditory memory for words 

Xi — Auditory memory for a connected passage 

x 3 — Visual recognition of forms 

The following correlations were obtained: 



Group A 

Group B 

Combined groups 

rn 

.23 

.73 

.82 

ru 

.37 

.15 

.71 

TiS 

.14 

.06 

.66 


If two or more groups have the same means on both variables, 
the correlation obtained from the combined groups is not dis¬ 
torted in any serious fashion. When the groups have different 
means on one or both variables, the effect of combining them 
may be to produce a correlation coefficient larger than the correla- 


1 These tables were computed and privately distributed by John C. Flanagan. 
They have been reproduced by permission in R. L. Thorndike, Personal Seleclion, Test 
and Measurement Techniques, John Wiley and Sons, 1949, and in H. M. Walker and 
J. Lev, Statistical Inference, Henry Holt, 1953. 

* Correlations of Mental Abilities, New York, Teachers College Bureau of of Publica¬ 
tions, 1912. 



167 


CORRELATION IN A COMPOSITE GROUP 

tinn in the separate groups, as in the example just cited or 
smaller, ^or even different in sign. Some such effects are sche- 

matically suggested in Figure 11 2 



... . How the Value of the Correlate wer 
Fig. 11-2 Sketches to Illustrate Groups with Unequal Means. 

Affected by the Combinat.on of Two M • grQup( |f |f high and positive. 

A. In each group, r » group , i» i, negative. 


A. In each group, r s gr ~oup, it is negative. 

B C £ rat buHow , composite group, it U P— - 


D . In^each group, r h negative. In composite group, I, h near zero. 

I _— nn O If V' 


D. In each group, r is , . 

Two groups with ugusl -““in“ wwTU» 

order “ ob ““ * j™e"wo v.ri.hle. wltbln * b » *" d 

wm no the "affected by the differences between the groups, 
do this formula (11.1) may be used 

(11.1) r 


= (2X0(27.) _ (H.Mliy _ L 2X '^ — |) 

sx,y, + xx.r, + xx, y, 


I ,, JTTxx.r _(ixji 3 w' V 2 r,+ 2 7;+ 2 y<— W t -sr i 

yzxi+zxi + ix, Jfi— N t A| 
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The formula as presented here is for a composite of three sub¬ 
groups, but can easily be adapted to a different number of groups. 
Such expressions as 2X,, ST*, etc., indicate summation for the 
subgroup only. 

Often the presence of a single extreme case presents the re¬ 
search worker with a dilemma. Thus Rowell found a 14-year-old 
child with IQ of 45 in a third-grade class. Either to include him 
or to exclude him from her computations would obviously mis¬ 
represent the situation. Under such circumstances the best pro¬ 
cedure is probably to state the situation so the reader can under¬ 
stand it and show how the extreme case affects results. Rowell 
(unpublished study) found correlations as follows: 


Traits 


With extreme 
case 


Without extreme 
case 


CA and IQ -.500 

CA and MA —.171 

MA and IQ -.917 


-.430 

-.078 

-.919 


Effect of Range. One of the chief troubles in using the corre¬ 
lation coefficient is the effect of the range of scores on the value 
of this coefficient. If the group is restricted in range on one of the 
variables, it is also so restricted on the others, and the correlation 
is greatly reduced in value. If the group has a great range of 
values on one variable, and if nonzero correlation exists, then the 
range on the other variable is relatively great, and the correlation 
is large in value. 

This effect is so disconcerting to some statisticians that they 
prefer not to use the coefficient of correlation at all. Instead, 
they use the regression formula and the related standard error of 
estimate as a description of the relation between two variables. 
The correlation coefficient is, nevertheless, generally used as a 
measure of relationship. The precaution must be taken, however, 
to keep the range of scores in mind when interpreting a correlation 
coefficient. 

As an example of the effect of changing the range of scores, 
consider the correlation between IQ and language for the children 
listed in Figure 10-3 on page 150. For the entire group this 
correlation is .58, but when the IQ range is restricted by con¬ 
sidering only children with IQ of 126 or higher from this group, 
the correlation is reduced to .20. When children with IQ less 
than !)1 or IQ greater than 125 are excluded, the correlation is 
reduced to .24. See Exercise 11.1. 
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effect of range 

The matter of range in test scores is very important in evaluat¬ 
ing validity and reliability coefficients. Such a coefficient based 
on a group which is highly homogeneous in ability is likely■to be 
low fOT that reason alone, in comparison with such a coefficient 

for ft frrouD very diverse in ability. , u 

1 problem of restriction of range arises when only those who 

achieve high test scores are selected for employment and the later 

iris." 

s'ids - —>■* '■ 

J? - •"tbs;.Tz,=s; *xsx «- 

the selected group t ^ ^ selected gr0 «p, the standard 

estimated by the formula 

r,y*S, _ 


( 11 . 2 ) 


Rtv = 


y/ rlySl + sl(l — T xv) 


EXERCISE 11 1 

alucs of r, b V x, biy, s t , ant l ^ 
m the data of Figure 10-3 if ce 
L tcd and computations were in 

Cases retained N 


. ^ 1U'J U.o 

(a) Entire group 

(b) Language score between 19 and ^ 2) 

27 inclusive ' 

(c) IQ between 91 and 125 inclusive 7. 5.8 

(d) Language score below 19 • > 

(e) Language score above -7 

(f) Language score below 19 or g } 

above 27 


md s u 

listec 

1 below 

which would be 

* 

3 if certain 

parts of 

the 

distribution 

,'ere made from the 

cases indie 

a ted: 

N 

s* 

s u 

r 

byw 

b IU 

109 

6.5 

17.8 

.58 

1.00 

.21 

48 

2.4 

14.4 

.10 

.93 

.020 

75 

5.8 

9.0 

.24 

.40 

.15 

30 

2.8 

10.0 

.39 

2.28 

.07 

25 

2.0 

13.9 

.13 

.70 

.025 

01 

8.4 

13.9 

.09 

1.05 

.29 
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Cases retained 

(g) IQ below 91 

(h) IQ above 125 

(i) IQ below 91 or above 125 


regression coefficients 


N 

8x 

8y 

r 

bys 

K 

15 

4.5 

7.9 

.26 

.45 

.15 

19 

4.0 

6.8 

.20 

.34 

.11 

34 

7.8 

28.9 

.85 

3.14 

.23 


2 In situation (b), the extreme values of x were removed. What did 
this removal do to s*? s y ? r? bj bj Which was affected more, a, or *7 

3 In situation (c), the extreme values of Y were removed. What did 
’ this removal do to a,? a v ? r? W bj Which was affected more, a. or a,? 

4 In situation (f), the middle of the distribution of x was removed. What 
did this do to 8x? a y ? r? 6*7 6xy? Which was affected more, a* or a v ? 

5 In situation (i), the middle of the distribution of Y was removed. 
What did this removal do to a,? a y ? r? V 6*? Which was affected 

more, a* or a v ? 


Variation in a Third Trait. Sometimes part of the difficulty 
in interpreting a coefficient of correlation comes from the fact 
that both of the correlated variables appear to be related to a 
third variable in such a way that the intrinsic relation between 
the two correlated variables is either exaggerated or suppressed 
by variation in the third. For example, in 1877, long before the 
coefficient of correlation was known, an American named Bow- 
ditch made a study of the physical characteristics of 24,500 Boston 
school children, and prepared what today we would call regression 
lines of weight on height, weight on age, and height on age. 8 
He was clearly looking for some way to express the relation between 
height and weight, and he nearly discovered the idea of correla¬ 
tion. However, he became confused by the possible influence of 
age on that relation and so failed to reach a satisfactory measure 
of relationship. Years later Ethel Elderton made body measure¬ 
ments of a large number of school children in Scotland. For girls 
ranging in age from 5 to 18, the correlation of height and weight 
was .91, but when the total group was broken up into smaller 
groups with an age range of only one year, the correlation was 
much lower. For children whose ages span a wide range, the 
correlation is affected by the fact that older children are, on the 
average, both taller and heavier than younger children, so the 
correlation r Kw = .91 actually involves age as well as height and 
weight. When age range is rigidly restricted, the correlation 
coefficient between height and weight averages in the low .80’s, 

* Henry Pickering Bowditch, "The Growth of Children,” Report of Board of Health 
of Massachusetts (1877). Reprinted in Papers on Anthropometry, Boston, 1894. 
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variation in a third trait 
though, of course, it would not be the same for infants as for older 

'^The effect of a third variable, which we may call *,°nthe corre- 
1 tion between .and, can be removed by —— 

lartialmr^elation of x and y, when z is held constant and denoted 
r (read “rxy point z"). What this formula actually does is 
to describe the "correlation of two residuals one the residual 
x _ jf where X is predicted from Z, and the other Y - Y, when 

is predicted from Z. The formula is 


(11.3) 


Tzy.z 


T Z y TxzTyi 


V(1 - r|,) (1 — rid 


Thus if for the three variables height, weight, and age, 

Tkv = .907, r ha = .70, and r va = .70 


the partial correlation is 


Thw.a 


.907 - ( .70) (.70) 

-"^49 


= .82 


onnmrh In most situations the per- 

The computations are * y ^ inter ested in the correlation 

"'erg nTTZero o^er correlation”) or r,„, (called a “first 
order partial”)• For that decision, the only guide is very careful 
thinking about the nature of the data 

TO U.U.WO .h. W »'**•<> *. 

some data gathered y ^ Qf 15 - census tracts in Baltimore 

juvenile delinquency. ure 0 f juvenile delinquency (y), 

(TV = 155), he obtained a^meas 1 of overcrow ding in 

of median education e idents w ho were non white (x 3 ), 

housing (ft), of tvne^ by their occupants (ft). The 

folwLg e coTe*ations were obtained among these variables: 


Xi 


x-i %3 x * y 

-.71 -41 .39 -.51 

— .69 -.72 .73 

_ -.76 .70 

— -.80 


X\ = median education 
x, = overcrowding 
Xi = percent nonwhite 

x 4 = percent homes owner occupied _ 

y = juvenile delinquency 

. Bernard Lander, tW an ~nd,n„ a/ 

University Press, 1954. 
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The correlations listed in the y column are all fairly large, but 
each requires interpretation in terms of the other variables. 
Suppose we had only the correlation r y3 = .70 and argued that 
this high correlation indicated that the presence of nonwhites 
is the chief cause of juvenile delinquency. We should be over¬ 
looking the fact that a census tract with a large percent of non¬ 
white residents is likely to be also a tract in which median educa¬ 
tion is low (r, 3 = - .41), overcrowding is prevalent (r 23 = .69), 
and relatively few homes are owner occupied (r 43 = - .76). If 
we should “partial out” one of these three other variables, we 
would obtain much smaller correlations: 

r til = - 70 -(" 51)( -’ 41) -.63 
' V (.74) (.83) 

.70-(.73) (.69) Jn 

r ” 3 ' 2 = V(.47)(^T " 

.70 - (- .80) (- .76) 

Tyri V (.36) (.42) 

Here any interpretation of correlation values as indicative of 
cause is obviously fallacious. If, however, the goal is to select 
census tracts in which delinquency is very likely to be high and 
other tracts in which it is likely to be low, the data can be used 
effectively. Selection on the basis of x 4 alone would serve very 
well. It is possible to obtain what is known as a multiple regression 
equation to estimate y from all four of the other variables together, 
but this technique is outside the scope of this text. For the Lander 
data, the multiple regression equation to predict the juvenile 
delinquency measure for a Baltimore census tract would be 

y = - .134xi + .163x 2 + .127x 3 — .534x 4 

when y, x>, x 3 , and x 4 are deviations from the means of the 
respective variables. The correlation between the criterion vari¬ 
able y and the estimate of y obtained by the multiple regression 
equation is known as a multiple correlation or multiple r. In the 
Lander data this multiple r is .84 whereas r y4 = .80, and so the 
single variable x 4 provides nearly as good a selective device as 
the combination of all four variables. 

EXEKCISE 11.2 

1. From the Lander data quoted above, confirm that r !4 = .39 while 
r u ,2 = - .24. Note, therefore, that a partial correlation may even be 
different in sign from the corresponding zero order coefficient. 
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EXERCISE 

effect of variation in overcrowding is removed (r,» - W- 

, ppn that the correlation between intelligence quotient 
3. We have already seen . t ^^fourth-grade children was .58. This 
and language score for the 10J to g younger children 

correlation, however, is parUy due to the. fact thatjouj^ ^ 

in the fourth grade ten. *° ch ffdren. Examine the subgroups 

ltedTn Ce aget Figure 11-3. Verify the correlations stated for the 

subgroups. 

Language Score 


147-153 

140-146 

133-139 

126-132 


119-125 


* 112-118 
9 

a 

• 105-111 


= 98-10J 
c 

91-9 


84-9 


77-0 


70-7 





BSnnm ffi aaw 

OEEiSifiHHnOB 


mlaBiP 


mHHI 


63-69 | A | A. 



19 1/ »' * 


Flg . , 1-3 'U,!,. SCO,. ~ ... Dim—. - 0— 

of Varying Age. N = 27 r = .50 


X Children aged 94-108 months 

O Children aged 109-113 man h* 

# Children aged 11 4-1 18 man h, 

A Children aged 119-156 months 

For entire group N = 109 and r - .58 


N = 27 
N = 31 
N = 26 
N = 25 


r = .50 
r = .42 
r = .35 
r =- .56 
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4 . For the 109 subjects in Appendix Table IX what would be the 
partial correlation between language score (x,) and intelligence co¬ 
efficient (x 2 ) with the influence of age (x 3 ) removed if the zero order 

correlations are as follows: 

ria = .583, ru = — .396, ru = — .723 


5. In a study of 65 delinquent boys at the New York State Training School 
at Warwick, Trent obtained measures on the following variables: 

X\ Anxiety = score on questionnaire items related to manifest anxiety 
x 2 L score = score on questionnaire items related to tendency to falsify 
x z Choice score = number of other boys who indicated a preference 
for this boy as a friend 

xi Rejection score = number of other boys who indicated dislike of 
this boy 

Xs Length of stay in the institution 
x 6 Intelligence coefficient 
x 1 Age in months 


The resulting zero order correlations were: 




r, Anxiety 
e 2 L scale 
e 3 Choice 
E( Rejection 
r 5 Length of stay 

IQ 

x 7 Age 


x 2 

x 3 

X\ 

Xi 

x« 

*7 

,094 

-.036 

.114 

.082 • 

-.131 

.107 

— 

.079 

-.172 

.090 

-.312 

.751 


— 

.749 

.147 

-.111 

.148 



— 

-.066 

-.206 

-.040 




— 

-.182 

.197 





— 

-.014 

will 

be discussed in 

Chapter 

16, we may 

hen a 

correlation based 

on 65 subjects 

is nu- 

as .25 

(that is, 

either r - 

- .25 or : 

r + .25) 

will it 


l)C cippi upi III LvJ IU iiout vwv. uuMjwtw -- 

similar subjects. Such a correlation will be termed significant, 
indicating that it has a general import, telling us something about 
the relationship for individuals other than the G5 subjects used 


in this study. If the size of this sample had been different, the 
dividing point between significant and nonsignificant values would 
not have been .25. 

Look over the list of correlations to decide which ones meet 
this criterion. 

(b) Examine r 27 = .751. Which of the following statements seems to 
you to be justified? What is wrong with each of the others? 

(1) In general, delinquent boys have a tendency to falsify more as 
they grow older. 

(2) In general, delinquent boys have a tendency to falsify less as 
they grow older. 
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EXERCISE 

(3) In general, all boys have a tendency to falsify less as they grow 

(4) In general, older boys in an institution for delinquents tend to 
falsify more than younger boys. 

(5) In general, older boys in an institution for delinquents tend to 
falsify less than younger boys. 

(6) If the oldest and the youngest boys in the institution are given 
’ the L scale, the oldest boy will obtain the higher score. 

(7) Among these 65 boys, the older ones made higher scores on th 
L scale, but the correlation was too small to permit generaliza¬ 
tion be’vond the 65 subjects examined. 

(c) Examine Z Which of the following statements seems to you to 

be justified? What is wrong with the others. 

(1) Children who are brighter tend to falsify more than those who 

(o\ Having a low IQ is the reason delinquent boys falsify. 

3 I 1^general, delinquent boys with a high IQ ^d to h e lower 
^ ’ I scores (i e to falsify less) than those with a low IQ. 

(4) For these 65 boy* there was a negative relation between .ntelh- 
W gence and L -ore, but the correlation was too low to warrant 

generalizing to other similar subjects. ide 

(5) The correlation r 2 . is large enough so that the IQ will pronde 

a good prediction people would 

(d) Examine r M . Di yon expec rejection score were practically 

assume that choice s chow a high negative correlation. 

opposite and sh ° uld h sU ,died here, a boy who is 

Hmvever, in such a i-up^ha^bee ^ ^ ^ „ my be much 

much sought after by h d so may score high on 

both and «, int is made to help you see how inl¬ 
and few rejections. dftta and to think about them, 

portant it is to understan y but does not think about 

and how the person ; numerically correct, 
the data may misint I other vur iables was positively rc- 

(e) “ sy n tive ^cr t c :‘: 

‘them K enough to jusUfy generalization to other similar sub- 
jccts? 

_ . .. fhpi Size of a Correlation Coefficient. This 

Sowar .. - th. a 1. 

that discussion. 
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The Normal Distribution 


I know of scarcely anything so apt to impress the imagination as 
the wonderful form of cosmic order expressed by the “Law of Fre¬ 
quency of Error.” The law would have been personified by the 
Greeks and deified, if they had known it. It reigns with serenity 
and in complete self-effacement amidst the wildest confusion. The 
huger the mob and the greater the apparent anarchy, the more per¬ 
fect is its sway. It is the supreme law of Unreason. Whenever a 
large sample of chaotic elements are taken in hand and marshalled 
in the order of their magnitude, an unsuspected and most beautiful 
form of regularity proves to have been latent all along. 

Sir Francis Galton 

Almost every person trained in any of the sciences knows and uses 
the term “normal curve” but many use it incorrectly or at least 
without understanding. Social scientists sometimes assert that 
when enough cases are observed, every human trait is normally 
distributed. This is not so. The curve is esthetically beautiful 
and mathematically interesting and many people assume that 
mathematicians have proved it to have a universal cogency. Teach¬ 
ers sometimes talk foolishly about “grading on the curve and 
deceive themselves into thinking they are being commendably 
scientific. However, in spite of popularly held misconceptions 
and all too frequent misuse, the normal distribution is a powerful 
statistical concept with extremely valuable practical applications. 

The Normal Curve. Suppose you have measured the heights 
of 50 adult men and from those measures have constructed a histo¬ 
gram with step intervals of 1 inch. Try to imagine in a general way 
how that histogram might look. It would be somewhat irregular 
because the number of cases is small. Almost certainly the fre¬ 
quencies would pile up in the neighborhood of the mean and fall 
off at both ends of the distribution. 
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THE NORMAL CURVE 

Suppr tsrir-“« 

SLTL"i”Xy” U» first distribution would loud to dc 
crease, and tire WatoRrar^would 1*™™ aseTti 10,000 and 

~-■=iiEr STS' “Sir k s 

greatly increased, tne distnoui . g nQW very obvious, 

piling up of frequencies arou t of the distribution. 

and we say in this last histogram are outlined 

Imagine that the reel g a wall at some distance 

in heavy ink and «B r attribution is still represented by a 

from you. The Iteque y thc tops 0 f those rectangles 

series of rectangles but curve Furt hermore for the variable 

may look to you like a sm __ which this 

named - that is, the heights 4 ““ mb le the ideal mathe- 

distribution suggests would stiong > 

matical curve known as thc norma cu with many narrow step 

Fifiur. .2-1 ,ho,.iu..™b.h;M id Tl.U 



tions. 


1S - . r„.ri'it,lr a histogram with narrow sli p 

If, for any continuous vana ^ , - (J . strilmtion for a very 

intervals is drawn to sh J q{ [ he rcctaIlgles will convey the 
large number of cases, tne p 


THE NORMAL DISTRIBUTION 


178 

general impression of a smooth curve. However, that curve will 
not necessarily resemble the normal curve you have seen in Fig¬ 
ure 12-1. Suppose, for example, that the variable is the annual 
dollar income of all families in some fairly large city. Toward 
the lower end of the income scale, the density will be much greater 
than toward the upper end, and the distribution will be consider¬ 
ably skewed, with its mean larger than its median and its median 
larger than its mode. The smooth curve suggested by the tops 
of the rectangles in this histogram would be similarly skewed and 
would not bear much resemblance to the normal curve. 

The Normal Distribution. The normal curve is a mathematical 
construct based on an equation which was first derived as a mathe¬ 
matical exercise without any reference to concrete data. It is an 
astonishing fact that many physical variables have frequency dis¬ 
tributions which bear a striking similarity to this mathematical 
curve. 

The theoretical normal curve never actually touches the base 
line and its range is unlimited, though it comes closer and closer 
to the base line as you go farther out in either direction. For any 
physical variable, such as height, the range must of necessity be 
limited. 

The area of all the rectangles which form a histogram represents 
the total frequency in the distribution. Similarly, the area under 
a normal curve represents the total frequency of a normally dis¬ 
tributed. variable. 

In a histogram, the frequency in any class interval is repre¬ 
sented by the area of a rec¬ 
tangle which has that inter¬ 
val as its base. In a normal 
curve, if one ordinate is 
drawn at any point A on the 
base line and another drawn 
at a second point B, the area of the segment between those ordi¬ 
nates represents the frequency of values in the interval AB. 

In graphing the distribution of a physical variable, it is impos¬ 
sible to make the intervals small enough for the distribution ac¬ 
tually to become a smooth curve. Thus the graph of a set of 
observed data will always be a step curve, or histogram. However, 
if the normal curve appears to approximate closely the. frequency 
distribution of a variable in a set of data, we are inclined to gen¬ 
eralize and to say that variable is normally distributed , or has a 
normal distribution. 
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horizontal line segment. , y- the standard deviation 
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5££|"? r“ diluibutlon fall into the three genera. 

categories which follow: interesting application is in 

1 Its most important ^/““VSmKumber of measures 
connection with generalization ro of a | a rger number of 

on observed individuals o “™ observe d. Much of the information 

individuals that have n education and government de- 

basic to modern science, industry education,^ ^ , g discussed 
rrC^r^pTemlartC, ii will not be elaborated on 

here 2 . Certain types of physical 

which so closely approxun^ i sciences furnish numerous 

tates their analysis, (a) the P . tabulation of the normal 

distributions of this nature- The of Ught ncar the 

distribution appeared in a boo Another early work used 

horizon, published shortly before A q ™ . Qn of lhe pola r star. 

it in relation to the determi surements on human beings 

(b) The distributions of some’ tical ' norma l distribution; 

have a striking «^blan<* tot-heUn ^ . in 

other measurements dp not - Kg hoinoRe neous as to racial origin 

a large, unselected group o • * ^ the distribution of 

and living under fairly similar cmkUt^ ^ (listrihution 

height closely approximatesit p included both sexes, 

would probably not be nor 
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or was made up of children of varying age, or included persons 
born in diverse parts of the globe and reared under different cli¬ 
matic and dietary conditions. The distribution of weight, on the 
other hand, is likely to be very far from normal because weight is 
not so directly dependent on heredity as height is, being partly 
controlled by individual preference. It takes little thought to 
recognize that many measures influenced by social customs have 
distributions very far from the normal form. Consider, for ex¬ 
ample, the distributions, for adults in the United States, of dollar 
income, number of years of education, average caloric intake per 
day, number of days illness in the past year, etc. About the middle 
of the nineteenth century, the great Belgian statistician, Adolph 
Quetelet, made studies of the heights of army recruits and decided 
that they followed the “normal law.” In that period there were 
almost no other large-scale compilations of data available, but 
Quetelet was convinced, by analogy to height, that when such 
became available they too would “obey the normal law” and his 
ideas were widely disseminated. This historical accident may be 
the origin of the current superstition that all variable traits are 
normally distributed. There are statistical tests for normality of 
distribution but these are outside the scope of this book. One 
such test is described on pages 119-123 of the authors’ Statistical 
Inference. 

3. For some variables the true form of distribution is unknown, 
but it is convenient to scale the variable in such a way as to produce 
a normal distribution of scores. Consider intelligence, for example. 
It seems reasonable to assume that an unselected group of children 
of a given age would show a normal distribution of intelligence 
test scores and most of the intelligence tests are scaled in such a 
way as to produce that result. In some situations only categorical 
ratings are available — as, for example, when a teacher assigns 
grades of A, B, C, D, and F. As nothing is known about the form 
of the underlying distribution, it often seems not unreasonable to 
transform these categorical ratings to scores on the assumption 
that the underlying trait has a normal distribution. The procedure 
of “normalizing” will be taken up at the end of this chapter. 

Drawing a Normal Curve. In order to make use of the normal 
distribution, one needs to be able to read values from such tables as 
Tables I and II in the Appendix and to have a clear understanding 
of what the tabular entries mean. An intuitive feeling for the 
shape of the curve is a real help. One of the best ways to develop 
such understanding is to make a drawing of the curve and to ex¬ 
amine its properties. 
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drawing a normal curve 

Look once more at Figure 12-1. If X represent the score value 

of any point on the horizontal scale, then —— would be the 
, 1 mint pvnressed in terms of the mean and 

■rs •zzztszzzzz* «—*.*■» •« ■ 

normal distribution, such a standard score would be denoted 


z = 


X-M 


or 


x 

z = - 

<J 


( 12 . 1 ) 

called . -deed .** 

axis of the curve we are about to d t0 the 

£ EEiSSi==.. i - 

. X — M _ 9 ^ Therefore, it would seem satisfactor> to 
is at z = —- 

iet zitJZTy “of'4* 

scale and mark on it points w uc i > interval between two 

and 3. It will be convenient tohaveU.o mte^ ^ ^ order 

successive points subdm ed d in Xable 12 . 1 . At the 

that you can easily plot the va axis and mark 0 n it the 

point ma ' kcd °',! rCC 20 a 25 30 .35, and .40 as in Figure 12-2. 
Cmay uf; -slale^rent fr'om that in this figure but do not 

make your drawing too small. j lot a po int in the usual 

JZT Iff p —* - — of 2 is 



" 3 , r ‘ (Ar .„ o. Ldsd ,.ccs 0 ls 1. ono l.nlh of .0,01 co 

Fig 12-2 The Normal Curve. V 
under the curve.) 
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TABLE 12.1 Abscissas (z) and Ordinates (y) of the Standard Normal 

Curve 


z 

y 

2 

V 

0 

.399 

1.6 

.111 

.2 

.391 

1.8 

.079 

.4 

.368 

2.0 

.054 

.6 

.333 

2.2 

.036 

.8 

.290 

2.4 

.022 

1.0 

.242 

2.6 

.014 

1.2 

.194 

2.8 

.008 

1.4 

.150 

3.0 

.004 


negative, thus producing a symmetrical pattern of dots as in 
Figure 12-2. Draw a smooth curve through the points you have 
plotted. If you find you cannot draw the curve smoothly and 
touch all the points marked, you may know you have estimated 
some values incorrectly and you should re-examine them. 

Characteristics of the Normal Distribution. Certain general 
characteristics of the normal distribution will now be stated and 
you should try to visualize them in relation to the figure you have 
just drawn. 


1 . The curve is symmetrical. If the figure were folded along 
its vertical axis, the two halves would coincide. 

2. The mean, median, and mode of the normal distribution are 
identical. 

3. The range is unlimited, infinite, in both directions, but as 
the distance from y increases the curve approaches the horizontal 
axis more and more closely. Theoretically, it never actually 
reaches the horizontal axis. 

4. The curve is smooth in contrast to the histograms or step 
curves we have previously drawn from observed data. 

5. The normal distribution has a standard deviation which we 
shall call <r. In the picture you have drawn, < 7 = 1 . In other 
words, the horizontal scale has been laid off in standard deviation 
units, so that the horizontal variable is actually in standardized 

form. Then the horizontal variable is- — • 


The Family of Normal Curves. We have said that the dis¬ 
tribution of heights of a population of adult persons of one sex 
closely approximates the normal distribution. If we consider men 
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and women a. s.pmf “‘J" 

s==? sap■ -* 

differ, and there may also be a difference between the two standa 

^WeTee in this pair of normal distributions an example of the 
fact that there ^ot a norm.. Us 

ir^elSiri hit mb. Of ca ri on win. it is 
based, are known. Severa m ® and J2 _ 5 The curves in Figure 
JS^eSrmetand <r,’but are based on different values 



F1b . 12-3 Noe.il Cwve, A„ H«ln 8 .He So.e M.on oed S.oedo.d De.o.lon 

but different Total Frequencies. 

For curve A, N = 300, a = 1 
For curve B, N = 200, a = 

For curve C, N = 100, a = 1 

ml o ;r 3 trimirp 12-4 all have the same N and m but 

of N. The curves 6 Figure 12-5 all have the 

differ in their values of <r. The curves in b 

same N ^ 

Parameters. i curves from other members of this 

member of the farm y o borrowed from mnthe- 

family are called parameters This one member of a 

matics where it is use gen * * b e same family. We shall 

family of curves from ot er ™ e h f information about 

find in the next two ^ Jroli .n t tiTJcal investigations. 

PaF siandard NormaTDistribution. When a normally distributed 

variable X is expressed in standard score form, * - = ? 

, J 0 and the standard deviation 

the mean of these standar b b COIls idered to be unity, 

is 1. The area, convenient to tabulate, to 

SSh". ....«^ ■“'* ” p “- 
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Fig. 12-4 Three Normal Curves AH Hav¬ 
ing the Same Mean and Total Frequency 
but Different Standard Deviations. 

For curve D, N = 300, a = 1 
For curve E, N = 300, a = 2 
For curve F, N = 300, a = 3 



Fig. 12-5 Three Normal Curves with 
Same Area but Different Values of fx 
and a. 



tion. Such a curve — with zero mean, unit standard. deviation, 
and unit area — is the form to which Tables I and II in the Ap¬ 
pendix relate. This form is called standard normal or unit normal. 
The equation for the general normal curve is 


( 12 . 2 ) 



(X - „)* 

N ST- 
<rV2v 6 


In standard form, this equation becomes 


(12.3) 




£» 

2 


e 


Formulas (12.2) and (12.3) have been presented for the in¬ 
formation of those readers who are interested. These formulas 
will not be used in any subsequent portions of this book. The 
normal distribution has been so thoroughly tabulated that few 
persons ever need to make use of its formula. 

Area Relationships under a Normal Curve. Ordinates are 
seldom needed for anything except for drawing the curve, and 
while there are some practical problems which call for knowledge 
of an ordinate you will not meet any such in this book. The area 
between two ordinates, on the other hand, is of great importance 
in a variety of problems, because such an area represents frequency. 
Before you can solve any problems of the kind presented in the 
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latter part of this chapter and in the two following chapters yon 

-SrHSrs ssSrass. 

ordinate at 5 - 0 and the ordinate at any other point, you may 

count the number of small squares in that area - est^Un g as 
well as you can the parts of squares cut by^the^cur ^ ^ 

divide this number by thei tota num ^ fairly we ll by 

area under the curve. The latter can in the tails is 

sheer counting but that is a ono . way . Draw a small 

hard to estimate. There is a height equal to the interval 

rectangle with length equa o a shaded rectangle in- 

from 0 to .1 on the vertical the ^ 

Figure 12-2). Count the nu^ “‘^ave the number of 
miiltinlv that number by anu 

small squares in the entire area^unde^oiKCur^ squares 

In the following examp es, ed will be compared with 

on the graph which you ave Table I are based on 

KirfS s 

r ^ •» ■“* ,k ' 

area in the specified segment by the total area. 

The column headed area in 
Table I gives the areas between 
an ordinate at a specified value 

x _ n 

of - and the ordinate at - - 

Such an area is shaded in the 

adjacent sketch. _ raD h find the percent of area 

By counting squares on > 2 * This should be approxi- 

between the ordinates at 0, a • value of .2 in Hie column 

mately 8. Now turn to Table I. find the va 

y Hntr pntrv in the column headed 

headed 5 and rea.l the correspondmg entry 

area. It is .0793 or approximately 8 percent^ ^ or(li . 

Again from your graph, me fay fillding the value 1.4 

..—.— 

0 



*/o 
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is given as .4192. As the curve is symmetrical, the area between 
ordinates at —1.4 and at 1.4 would be .4192 + .4192 = .8384 or .84. 

Find the area to the left of the ordinate at 1.8. The area be¬ 
tween ordinates at 0 and at 1.8 is .4641, and you know that half of 
the total area is to the left of the ordinate at 0. Therefore, the 
area to the left of the ordinate at 1.8 is .5000 -f .4641 = .9641, or 
.96. 

What area is to the left of an ordinate at -1.8? Reference to 
your graph will convince you that this is .5000 - .4641 = .0359, or 
.04. 

What area lies outside the ordinates at — 1.8'and at 1.8? This 
is the area to the left of -1.8 and the area to the right of 1.8, or 
.0359 -f .0359 = .0718 or .07. It can, of course, also be computed 
as 1.0000 - 2(.4641) = 1.0000 - .9282 = .0718. 


EXERCISE 12.1 

(Some students will need to go through only a few of the exercises in 
each group in order to achieve mastery. Others will need much practice. 
It is suggested that you check your results with the answer key and stop 
work in each group as soon as you find you are making no errors.) 

A. First, by counting squares on your graph and then by reading from 
Table I, find the area between an ordinate at the mean and an ordinate 

X 

at a point for which — has the value indicated. 

<7 

1. 00 3. 1.60 5. -2.00 7. 1.00 

2. .40 4. -.10 6. 1.80 8. .70 

B. What is the area between symmetrically placed ordinates at the in- 

X 

dicated values of -? Read answers from the table. 

O 

1. -.45 and .45 3. -1.00 and 1.00 5. -2.57 and 2.57 

2. -.67 and .67 4. -1.96 and 1.96 6. -1.03 and 1.03 

C. What is the area to the left of an ordinate at the indicated value of 

X- 

-? Read answers from the table. 

o 

1* -15 3. 1.90 5. -1.85 7. .67 

2. -1-32 4. 1.67 6. -1.65 8. -.67 

D. What is the area outside ordinates at the indicated values of z = -? 
Read answers from the table. 

_1. 1.96 and 1.96 3. -2.32 and 2.32 

2. -1.G4 and 1.64 4. -2.58 and 2.58 


5. —.75 and .75 

6. —2.81 and 2.81 
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Percentiles of the Normal Distribution. Suppose you have 
read from Table I that 73 percent of the area under the unit norma 

curve lies below - he., to the left of - an ordinate at a = - = 
, 61 . The same relationship may be stated in either of the following 

Wa5 (a) a = -61 is the 73rd percentile of a unit normal distribution 

(b) z.™ = -61 

chapter, the Letter * 

used to designate a percentile. A distribution, 

to denote a standardized deviate of: the"“rt norm^ & 

we shall use a with a PP™ p " a ® * he median (and also the mean), 
of that distribution. Thu so represents a point 

Whenever a has a subscript larger than £>0, it "P™?^ v P ues . 

to the right of the median an su represents a point 

Whenever a has a subscript smaller tlan .-ou, P ^ yalues 

to the left of the median a " d ®" ch P b ri t teUs us that 90 percent 
What is the value o r. The J^P and> there f 0 re, we 

of the area lies to the left = rcent lies between that 

know that 90 percent - 50 percen40 P , umn t0 find the 

point and 0. In Table I we look alonI»ea correspon ding 

entry which is closest to .40. Thisistnee. 

i j f — 1 28 This solution re- 
to - = 1.28. We conclude that z. 90 - 

a • Knt Anoendix Table II would have given 

quired some maneuvering but Appendix^ ^ 

us immediately the statement thg subscript is less than 

What is the value of .lo- . sj nC e 10 percent of the 

.50 we know the value must e " * n ^ w and t he mean must be 

area is to the left of a,, the ^^^“e find"the number nearest 

■fo In'the = arc 0 a columtlnd read the corresponding value in the 

£ column. As before, it is 1.28, but now we must attach a minus 

£ Thus z.io = - I-2S. Now look in Table II. Here you read 

that a.,o= - l- 282 - , 1.282, the following relations 

Since z.io = - l- 282 ana Z M 
hold: 


2.10 

Z.90 

Z.10 + z.w 


— Z. 90 

— Z.10 
0 


It 
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These statements might be generalized by using some letter, say, 
a, to represent one of the subscripts: 


Z a = — Z\-a 
Z\—a = ~ Z a 
Z a + Z\-a = 0 

If a normal distribution has /x = 72 and a = 13, what is its 40th 
percentile? From Table II we read that the 40th percentile of 

the standard normal curve is -.253. This tells us that ———- 
= -.253. 47 

If /x = 72 and a = 13, then ^^ ^ = - .253 


Therefore, X. i0 - 72 = - .253(13) = - 3.289 

X.«> = 72 - 3.289 
X AQ = 68.7 


EXERtlSE 12.2 

In each group answer only as many questions as you need to achieve 
mastery. Be sure to go on to Group I as the questions in that group pro¬ 
vide important background for Chapter 13. 

A. What area is to the left of each value? 

1* 2 07 2. 2.96 3. 2.Si 4. 2.J1 5. 2.41 

B. What area is to the right of each value? 

1 • 2 99 2 . 2.05 3 . 2.40 4 . 2.995 5 . 2.96 


C. Which of the values named in groups A and B are negative? 

D. What area lies between the stated values? 


1. 2.oi and 2.99 

2. 2.oos and 2.995 

3 . 2.25 and 2. 7S 

4 . 2.02 and 2.0a 

5 . 2.015 and 2.975 

6 . 2.60 and 2.71 

E. By reference to Table II verify the following statements: 

1 . 2 . 0l = - 2 . 32 G 

2. 2.25 = — .0745 

3 . 2 ., 2 = - 1.175 

4. 2.70 = .524 

5 . 2.75 — .G /45 

6. z .80 = .842 

7 . 2.20 = — .842 

8. 2.94 = 1.555 

9. 2.999 — 3.090 

10 . 2.005 = - 2.576 
11 - 2.99s = 2.576 

12. 2.975 = 1.960 

F. What area lies outside the stated values? (This area is the sum of the 

area to the left of the smaller value and the area to the right of the 
larger.) 

1 . 2.01 and 2.99 

2. 2.005 and 2.595 

3 . 2.025 and 2.975 

4 . 2.02 and 2.9a 

5 . 2.05 and 2.95 
G. 2.10 and 2.90 
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PROBABLE ERROR 

O Between which percentile values is the middle 30 percent of the area 
included? Solution: If 30 percent of the area is in a strip symmetnca.ly 

placed around the mean, one 
half of 30 percent = 15 per¬ 
cent must lie on each side 
of the mean. Therefore, 
the area below the lower 

edge of that strip must be _ ^ . 

1^-35 and below the upper edge, .o 0 + .lo - . 00 . int v 
.50 — .15 — .oo ana oeio w . 3 g5. These values were 

' therefore, extends from z.a* - ' - 385 ^°^ „ and int . rp0 , a ting be- 

obtained by reading and z.» from lame 

tween them. 

H. State the percentile values which would bound a ^Kldcsu.pof ar 
symmetrically placed around the mean if that area is as folio,. ^ 

1. .99 3. .50 '?? g. .999 

2. .98 4. .95 6 - 80 

I. In a normal distribution with p = 64 and <r = 8 , 

1 . What is the value of the 10th percentile? 
o What is the value of the 70th percentile. 

3 . What proportion of cases would have scores larger than 70? 

4. What proportion of caseS ,7^ mid^sT^rcent of the cases lie? 

5. Between what scores wou “ the „f the cases lie? 

6. Between what scores would the middle P 

7. Beyond what score would the hig e>‘ P" gard t0 the mean, 

8. Outside what two scores, symmetrically placed reg 

would the most extreme 1 percent lie. 

J. Answer the eight questions asked in Group I if P = 91 and a - 15. 

K. Answer the eight questions asked in Group I if P = j! > and a^ 

L. Answer the eight questions asked in Group I if p = 6 and <r - 4. 

is called the probable error. T m ^ ^ ^ since 1815 and 
it is somewhat misleading but 

‘ The term was introduced in 181 V'th^p^T^ pohTsUr, published in the 

Friedrich Wilhelm Itessel in a P'J|T Sllldies in th . History of Statistics, p. 24). 

Berlin Agronomical ^ earl>ook. • Almont at once the expression appeare d 

He used the term der xvahncheinliche ' vro babali* in Latin, as Verrcur probable 

in scientific journals all overEuropc. P pro bable error in English. '1 he roots 

in French, as I'errore profile m Italian, as me ? 

of this term arc deep and teuacious. 
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seems to survive all efforts to do away with it. Neither the term 
nor the idea will be used again in this book. It is mentioned here 
only because you are likely to meet it in your reading elsewhere. 

Normal Probability. In certain very interesting and very im¬ 
portant types of problems, some of which are treated more fully in 
Chapters 13 through 16, the task is to assess the likelihood that a 
single individual, chosen “at random” from a group of individ¬ 
uals, will come from a particular category of that group. In such 
problems the proportional frequency in each category is called the 
probability of that category and the proportional frequency distribution 
is called a probability distribution. 

There are many forms of probability distribution; not by any 
means are all probability distributions normal. A probability dis¬ 
tribution may have a continuous scale or may consist of discrete 
classes. It may consist of just two classes. For example, it is 
well known that about 51 percent of newborn babies are boys and 
about 49 percent girls. The two classes, boys and girls, and the 
relative frequencies .51 and .49 constitute a probability distribu¬ 
tion. One may say that there is probability .51 that the first baby 
born in Chicago in the year 1975 will be a boy. In general, a prob¬ 
ability distribution consists of a set of classes and the relative 
frequencies related to them. Many extremely useful probability 
distributions are described in more advanced texts. At this point 
our emphasis will not be on the topic of probability in general, 
but on the use of the normal distribution as a probability distribu¬ 
tion in preparation for a better understanding of the important 
ideas taken up in the next four chapters. 

First let us consider a rather artificial situation. Suppose that 
for each of 50,000 school children an intelligence quotient is avail¬ 
able and that the distribution of these is approximately normal. 
If each child’s IQ is recorded on a small ticket, all the tickets 
are thoroughly mixed in a huge lottery, and one ticket is drawn at 
random, 2 what is the probability that the IQ recorded on that 
ticket will be at least two standard deviations above the mean 
of the entire group of 50,000? Because the distribution has been 
described as normal, this question can be answered by merely 
finding the area under a normal curve above an ordinate at z = 

Y_ 

= 2. Table I shows this area to be .023. What is the prob¬ 
ability that a ticket drawn at random will contain an IQ very 


1 .Sonin discussion of what drawing “at random” moans will ho found in the next 
chapter. For the present the term will he left admittedly vague. 
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close to the mean - say, one lying within 0.1 of a standard de¬ 
viation from the mean? Answer 2(.0398) - .08. 

Now let us discard as irrelevant the fiction of writing the IQ 
on tickets and physically drawing one by means o ,i lotterj bu 
retain the idea of a very large normally distributed populat on ot 
IQ’s Se ect ng at random an individual from such a population 

of',— diK.ul*y P«»P" sc 

intelligence levels do not tend to frequent the same places- 
us again place a question with respect to an unborn mfant. ^ ha 
is the probability that the first child born in Chicago in Uie^yea^ 
1975 will have an intelligence quotient at least 3 s 

„on. b*. lb. gen- ~.«J 

under the normal curve below z - - d. la 

■ 001 ln most physical populations it is difficult either to ^establish 
the normality of distribution or to select individuals at randon 
both. The principal use of the norma curve aaa Probability d 
tribution is in connection with sampling problems as desert 

" “LStitT Group. .1® *!»■' *»«•- 3 ™"” * """1 

job satisfaction (or interest in teaching, or con ^ . 

or any other attitude difficult to measum objectively) ^ ^ 

search worker has drawn up a lis ^satisfaction and 

ranging from extreme 1 ^-nts looks 

wants to place these on a scale. Ranking difficult 

like a formidable undertaking but it does no 

to place them in ordered categories. t iwsunDose 

The number of categories is an arbitrary c ioice. L ^ ^PP ^ 
the investigator decides to use 9 categories. 1 . represents a 

makes the assumption that his set o s a wishes to have 

normal distribution of job satisfaction am u ■ 

each of the 9 categories cover the same 

distribution. Strictly speaking this is would 

of a normal distribution is infinite and of 

also be infinite. However, if he lets eat » < a . from 

0.5a, the 9 categories will cover a range of 4.5a, or 

—2.25<r to 2.25<r. . Son shows that 

Reference to the table of the P curve lies 

only a little more than 1 percent invest ieator 

above 2.25<r or below -2.25a. Consequentlj t..c mn. 

may decide to make the end intervals open, so that all 
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1.75a - constitute one category and all cases below —1.75 constitute 
another category. The following tabulation shows the 9 categories 
formed by such a subdivision of the scale, the corresponding per¬ 
cents of area under the curve, the number of statements in each 
category in accordance with the indicated percents, and code 
numbers ranging from 1 to 9 which are to be the scale values of 
the categories. 


Category 

Percent 

Number 

Code 

Above 

1.75 

4.01 or 4 

6 

9 

1.25 to 

1.75 

6.55 or 7 

10 

8 

.75 to 

1.25 

12.10 or 12 

18 

7 

.25 to 

.75 

17.47 or 17 

26 

6 

-.25 to 

.25 

19.74 or 20 

30 

5 

-.75 to 

-.25 

17.47 or 17 

26 

4 

— 1.25 to 

—.75 

12.10 or 12 

18 

3 

-1.75 to 

-1.25 

6.55 or 7 

10 

2 

Below 

-1.75 

4.01 or 4 

6 

1 



100.00 

150 




The number of items to be placed in each of the nine categories 
is shown in the accompanying sketch as an area under the normal 
curve between ordinates drawn at the points indicated. 

The statements must now be distributed in accordance with 
the frequencies shown in the tabulation. Each statement is writ¬ 
ten on a separate slip of paper and these slips are sorted into 9 

piles of the size specified. The 
6 statements (no more and no 
less) which are judged as ex¬ 
pressing the greatest degree 
10 1^^- of job satisfaction (or interest 
- 7 A -»a- 3 a -v* 'A 3 A 3 A 7 A in teaching, or whatever vari¬ 

able is under consideration) are placed at the extreme right and 
coded 9, while the 6 which are judged to express the greatest dis¬ 
satisfaction are placed at the extreme left and coded 1. The other 
slips are assigned to intermediate piles in such a way as to form 
a set of ordered classes. The code numbers from 1 to 9 which are 
assigned to these classes are usually treated as constituting a scale. 

This is similar to the line of argument on which is based the 
slanine scale (Standard nine) employed by the United States Air 
Force in its testing and classification program. 

This is also the general line of procedure for what is called the 
Q-sort used in many psychological studies. Each of the items 
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to be sorted is written on a separate card, and the judges work 
independently. If they are told how many cards to place m each 
pile P every judge will achieve the same mean and standard devia¬ 
tion If they are left free to decide how many cards to place in 
each pile there may be considerable discrepancy from judge to 
udge as io mean and as to standard deviation of t e d.stnbution 
of cards The purpose of having more than one judge to do the 
sorting is twofold: (a) to ascertain how well different judges can 
agree on the placement of items and (b) to improve measurement 

by eliminating 

isrc r atf r i rats ■» 

items, namely 2, 3, 6, 9, 10, 9, 6, 3, 

EXERCISE 12.3 

verify enough of the following statements to be sure you understand 
frequencies^ each category will ’be .0058, .2417, .3830, .2417, and .0008, 

2 . H indbddualsarc^cl^sifi^d’in^categories each with jan^ £ 

-SXS2Z &£2 

.02, .14, .34, .34, .14, and h2. the division points will be 

4 . If 11 categories are used with r g j , 8 , and the relative 

.u .t«. ° 78 ' 

,. ,n;«•” «S 

..“" i ' oi ' “■ *■ ■* 

.20, .17, .12, .07, .03, and .01. 

Transforming O^eredCategories into Scaled Scorers. ^ Lrtus 
schooMor'scholarships, with the stipulation that the recipients 
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shall be chosen on the basis of average class marks. This presents 
a problem because the school’s records are in the form of letter 
grades and now it must average these letter grades in some way 
which will appear fair and objective to all concerned. One member 
of the selection committee proposes to use the arbitrary code: 
A = 10, B = 9, C = 8, D = 7, E = 6, F = 5. Another member 
points out that reducing each code number by 5 so that A = 5, 
B = 4, C = 3, D = 2, E = 1, and F = 0 would simply reduce each 
pupil’s average by 5, thus leaving pupils in the same relative order, 
but would somewhat lessen the work of obtaining averages. The 
committee is about to adopt this latter code when someone calls 
attention to the marks given over a period of several terms by 
three teachers in the same department, teaching students who are 
presumably similar in ability. These three teachers, Smith, Jones, 
and Brown, have assigned marks as indicated in Table 12.2. One 
member of the committee objects to the code on the ground that 
the letters do not seem to have exactly the same meaning as used 
by these three teachers, that an A given by Brown does not present 
as strong evidence of high achievement as one given by Smith. 
Another objection he makes to the code is that it is based on the 
assumption that the 6 letters represent 6 evenly spaced points on 

TABLE 12.2 The Number and Percent of Students to Whom Each of 
Three Teachers Has Assigned the Specified Letter Mark 


Number of marks given by Percent of marks given by 


Mark 

Smith 

Jones 

Brown 

Smith 

Jones 

Brown 

A 

24 

36 

106 

9.6% 

12 .0% 

26.5% 

B 

130 

78 

152 

52.0 

26.0 

38.0 

C 

68 

95 

96 

27.2 

31.7 

24.0 

D 

12 

55 

36 

4.8 

18.3 

9.0 

E 

8 

12 

6 

3.2 

4.0 

1.5 

F 

8 

24 

4 

3.2 

8.0 

1.0 

Total 

250 

300 

400 

100 .0% 

100 .0% 

100 .0% 

a scale, 

that A 

is assumed to be as 

much better than B 

as B is 

better than C, 

etc. This he says is 

unrealistic. 

This committee 

member 

proposes an 

alternate plan of scaling, 

based on 

the as- 


sumption that there was really a normal distribution of perform¬ 
ance for the students taught by each teacher. He admits that this 
assumption may not be true — that it can neither be established 
nor refuted because there is no evidence. He thinks the assumption 
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of normality is at least as acceptable as the assumption that incre 
ments between consecutive letters represent equal scale intervals, 
fn orderTplace the categorical ratings (the letter grades) on a 
scale it is necessary either to make an assumption concerning the 
scale’interval (i.e., assign an arbitrary code), or to, make an. assump¬ 
tion concerning the form of distribution and to derive the 

equivalents from that distribution. ki t he 

The committee member who argued in favor of making t 

assumption of normality carried out the appropriate computation 
and presented the following transformation code for the 
Smith, Jones, and Brown: 

Mark ' ** Smith Jones 


A 

B 

C 

D 

E 

F 


67 

54 

43 

36 

33 

29 


66 

57 

49 

42 

37 

32 


Brown 

61 

51 

43 

35 

29 

24 


He admitted, h.,„,e”d m‘.S 

SSOSS& s& * srs 

—r i ri:— 1 1 

“ xzszsfe** ns star x 

sa - ,h ' 

percents of frequency shown 
in Table 12.2. For each seg¬ 
ment of area there must be 
found a value of the normal 
deviate z which will serve as 
typical value for that cate¬ 
gory. Either the median value ^ bg use( j > but the median 

of the category or the mea Consider category A. 

is easier to obtain and so we .hall use it.. Consnlcr J ^ 

The median of that category is P lerefore , has .00 of the 

area of the segment on either « de and there + 04 + 

formed value for grade A will be 15.55 + oO 60. 



196 


THE NORMAL DISTRIBUTION 


Now consider the median of category B. The area below it ''dll 
be .08 + .04 + .183 + .317 + .13 = .75 and the area above it .13 + 
.12 = .25. The value of z. 7 & is .6745 and T = 50 + 6.7 = 57. 


TABLE 12.3 Computations Used in Transforming a Set of Ordered 
Categories to Scaled Scores (T) on the Assumption 

of Normality of Distribution 


Category 

Frequency 

Percent 

in 

category 

Percent 
below 
median of 
category 

z 

T 

= 50+102 

A 

36 

12 .0% 

94.0% 

1.56 

66 

B 

78 

26.0 

75.0 

.67 

57 

C 

95 

31.67 

46.17 

-.10 

49 

D 

55 

18.33 

21.17 

-.80 

42 

E 

12 

4.0 

10.0 

-1.288 

37 

F 

24 

8.0 

4.0 

-1.75 

32 


300 

100 .0% 





The computations leading to the transformation are presented 
in Table 12.3. Here the first two columns provide the data. The 
third column is obtained by dividing each frequency by the sum 
of all the frequencies (300 in this case). The percent below the 
median of the category for category C is 8.0 + 4.0 + 18.33 + 
31 67 

-k- - = 46.17, and similarly for the other categories. The z values 


are read from either Table I or II and need not be recorded with 
great precision because after they are multiplied by 10 they will be 
rounded to the nearest integer. The entries in the final column 
are obtained by multiplying z by 10 and adding 50. 

Such scores were named T scores by William A. McCall in 
honor of E. L. Thorndike and Louis Terman, pioneers in the meas¬ 
urement movement. Each T unit is one tenth of a standard de¬ 
viation. The mean T is always 50 except as it may be affected by 
rounding errors. These transformed T scores must not be confused 

X - X 

with the transformed standard scores, Z = 50 4- 10 --— used in 


Chapter 7. There are three important distinctions. In the first 
place, Z scores cannot be obtained from ordered categories but 
only from .scaled scores. In the second place, in order to obtain Z 
scores, one computes a mean and a standard deviation and then 
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2 = 50 + 10 (X ~ ; whereas to obtain T scores one computes 

percentile ranks (the percent below the midpoint of the cat ^ or >'|> 
reads the corresponding values of z from a table of normal prob¬ 
ability and then T = 50 + 10a. Only the final step of the o 

procedures is the same. In the third place T scores have a norma, 

distribution regardless of the form of the original distribution, 
5 scores do not have a normal distribution unless the original van- 
able had a normal distribution. 


20 

30 

04 

48 

30 

8 


EXERCISE 12.4 

1. Verify the T score values given for the grades of Smith and Brown on 

2 . Supposed supervisor has placed a group of 200 teachers in 6 qualitative 

categories as follows: 

Excellent in every way 
Superior in most traits 
Above average 

Fair .. . . 

Poor but not subject to dismissal 
Very poor subject to dismissal 

What transformed scores correspond to these classes? 

Profile Chart When an individual is to be evaluated relative 
to a group on several traits at once, it is customary to plot Ins 

s T^.r i 

5 & r £ 

referred to the standard normal ^ but.on. 

at the 50th percentile rank and extends to Jhe-ale^ ^ 

sentmg the individual s P cre , , , perce ntile rank 26 

we see from the bar* ‘hat Pupcomputation, and so on. 

" \ Q ’ "hbliofto i s value "showing the relationship of an in- 
In addition to va ^ usefu , in companng ,n- 

sssxsxx *r s 
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Cumulative Percent 

Fig. 12-6 Profile on Normal Probability Scale for Pupil No. 1 of Table IX. 
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transforming ranks to normal deviates 

IQ and the highest scores on the verbal tests of reading speed and 
language, but is lowest in computation and history-civics. 

A profile chart like that shown in Figure 12-6 can be made 
most simply by plotting percentile ranks on specially printed 
paper which can be purchased as “arithmetic probability paper. 

If such paper is not available, the percentile ranks can be converted 

a line grap \ rather than connecting lines 

Usting — —f a trend, and simplifies both 

S”.“ r'£‘f sssr t 

;£325 - 

tribution than at the middle. profile chart by plotting 

Another common practice isto m J profile ^ ^ 

the standard scores described in Chapter from trait 

practice is limited when the shape M -t on mfias . 

to trait. Consider, for example.two ^ Qn U)C 

ured by an easy test an ^ q one and a half standard dc- 

whereas on 

standard scoreof onc^and onTlmhmeans something q uite different 

on the two tests. n^viates. The use of ranks 

Transforming Ranks to “^"iibe.l in Chapter 10. 
in computing a correlation cot . ^ ^ Ulc underlying variable 

It often seems reasonableit . normal|y distribuled. Appendix 
represented by the ran ' f t () f ranks into a set of scores 

Table X may be used to transform a .ct for samples 

from a normal distribution ... which g - •’« 
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in which 5 ^ N ^ 30. To find the normal equivalent of a rank 
R when N > 30, first find the proportion 

(12.4) p = 1-^— 

then read in Table II the normal deviate z p in a unit normal curve 
corresponding to the proportion p. The corresponding normal 
deviate when p = 50 and a = 10 is 

Z p = 10 z p + 50 

Discovery of the Normal Curve. 3 The curve which we now 
call “normal” is the curve of normal probability or curve of error 
which attracted the interest of most of the great mathematicians 
and astronomers of the first half of the nineteenth century. More 
than two centuries ago, Abraham De Moivre (1667-1754), a ref¬ 
ugee mathematician living in London, making his living partly 
by solving problems for wealthy gamblers, recognized that the 
random variation in the number of heads appearing on throws of 
n coins corresponds to the terms of the binomial expansion of 
(.5 + .5)", and that as n becomes larger this distribution approaches 
a definite form. In 1733 De Moivre derived the equation for this 
curve and presented it privately to some friends. To him it was 
only a mathematical exercise, utterly unconnected with any sort 
of application to empirical data. 

At that time there were no collections of empirical data at hand 
for study. As yet no one had made any measurements of any 
large number of individuals. A Swiss mathematician named 
Jacques (or James) Bernoulli (1654-1705, eldest of the very re¬ 
markable family of mathematicians by that name) had suggested 
that the theory of probability might have useful applications in 
economic and moral affairs, but he, himself, was too near death 
to investigate the applications, and, moreover, he had no numerical 
data which he could have used for that purpose. The idea must 
have seemed fantastic to his contemporaries. In 1713, his great 
book on probability, Ars Conjectandi , 4 was published posthu¬ 
mously under the editorship of his nephew Nicolas, who seems to 
have tried in vain to induce other mathematicians to develop the 

3 For a more extended treatment, see H. M. Walker, Studies in the Hislon/ of Sta¬ 
tistical Method, Baltimore, The Williams* Wilkins Co., 1929, Chapter II; “Bi-centenary 
of the normal curve,” Journal of the American Statistical Association, 29 (1934), 72-75; 

‘Abraham de Moivre,” Scripta Mathcmatica, 2 (1934), 316-333. 

4 Conjeeto = to throw together, therefore, to gamble, therefore, to guess at or 
sunnifie. 
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discovery of the normal curve 

argument further. Nearly a century passed be [? re a ^ToTtudy 
worker began to gather large masses of concrete data and to studv 

the properties of distributions. The application of the norma 

eurve °n P studies of concrete data begins with the work o he great 

mathematical astronom s who ^ Gauss 

5SWS whom U--SXS5; 

pendently and presumably without any knowledge of De 
"probability curve is often called 

cause until recently it was suppose However in 1924 , Karl 

De Moivre. 6 , describe data other 

than errors of observation in 

originated with the great e . gia , . , at statistical method 

(1796-1874), who first popuianzed theidea that srat ^ ^ ^ 

was a fundamental discipline adaptable' *^ « w ° as Evinced 

terest in which mass fwaited only for 

that the measurement of menta , and was surc 

the collection of sufficient an r the distribution of these 

-» - - - r; - 

. Pearson, K., “ Historical not* on the origin of the normal curve of error, Bio- 

melrika, 16 (1924), 402-404. 
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Introduction to Statistical 
Inference 


In an investigation which deals with matters of general interest, 
the findings are usually applied to a much larger number of cases 
than were actually observed in the sample of cases. This use of 
samples in statistics is probably the most important and certainly 
the most intriguing aspect of this science. The idea that informa¬ 
tion obtained from the examination of a relatively small number 
of cases can be used to throw light on the characteristics of a vast 
and unexamined universe is an exciting idea. That the sample 
not only furnishes an estimate of some characteristic of an un¬ 
known population, but also furnishes a measure of how much 
faith can be placed in that estimate, is still more remarkable. 

Situations in Which Samples Are Employed. Statistical sam¬ 
pling plays a part in almost all statistical studies on which de¬ 
cisions for future action are to be based. Sometimes sampling is 
necessary because the universe, or population, about which infor¬ 
mation is desired is infinite and could never be examined in its 
entirety. This situation is typical of most psychological experi¬ 
ments. Some populations, even though finite, are so vast or so 
inaccessible that it would not be feasible to examine all their 
members. Sometimes, as in Example 1 below, it is important to 
process the results more quickly than would be possible if a com¬ 
plete count (or census) of the entire population were made. In 
certain kinds of research, measurement means the destruction 
of the individual measured. This is the typical situation in 
industrial studies where the strength of material or the length of 
life of some product is measured. To measure the tensile strength 
of cloth, pieces of cloth must be destroyed; to measure the break¬ 
ing load of steel, steel must be broken; to measure the effective 
life of a battery or a light bulb or an automobile tire, it must be 
used until it is worthless. Consequently, in much industrial 
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research the idea of studying an entire population is fantastic, 
and even the use of large samples is prohibitive. 

Population. A sampling study is characterized by the fact that 
its objective is not simply to describe those individuals observed in 
the sample, but rather to search for information about the popula¬ 
tion from which the sample was drawn. 

The statistical concept of a population , also called a universe, 
is an abstract concept fundamental to statistical inference. “It 
refers to the totality of numbers that would result from indefi¬ 
nitely many repetitions of the same process of selecting objects, 
measuring or classifying them, and recording the results. A 
population is, thus, a fixed body of numbers about which we would 
like to know. What we actually know is the numbers of a sample, 
a group selected from the population.” 1 The reader must not 
think of a population in this sense as necessarily consisting of 
people or even of measurements of people. The three examples 
which follow happen all to be situations in which the population 
consists of observations on persons; they were chosen not because 
of but in spite of that peculiarity. A population might consist of 
observations on schoolhouses, on books, on cities, on shoes, on 
carloads of grain, on manufactured items, and so on. 

To appreciate the nature of such studies, let us examine a few 

examples more closely. 

1. Estimating the number of unemployed in the United States. 
In order to obtain current information on the state of the labor 
market, the United States Bureau of the Census conducts periodic 
sampling studies from which it calculates an estimate of the num¬ 
ber of persons unemployed in various kinds of work and in dif¬ 
ferent parts of the country. For the purpose of this investigation, 
each person in the labor market is placed in one of two classes: 
employed and unemployed. A two-class population i.s ca le a 
dichotomous population. The sample is necessarily obtained by 
selecting individual persons from the total number in the labor 
market. For the purpose of this investigation, when John Jones 
is drawn into the sample, we are not interested in all the persona 
attributes of John Jones as a human being, but only in w iet ier 
he belongs in the class of the employed ( E ) or the unemp o>e ( ). 
Thus the population is the totality of all observations U and t 
and the sample consists of the number of observations o aiu 
of E among the individuals selected. Ihis particular examp e 

1 W. Allen Wallis and Harry V. Roberts, Statistics, a New Approach, The P ree 1 ress, 
Glencoe, Ill., 1956, page 126. 
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was chosen because it illustrates a dichotomous population and 
because it illustrates a situation in which the processing of a sample 
can be done rapidly enough to provide information while it is 
fresh and useful. The processing of a complete census would 
require so long that, by the time the findings could be published, 
they might no longer be applicable. 

2. Standardizing a test. Standardized tests of achievement, of 
ability, of attitudes, or of preferences, etc., are used to determine 
the status of an individual person relative to a population, or of a 
subgroup relative to a population. (Such tests are also used to 
compare subgroups, but for this purpose it is not usually necessary 
that they be standardized.) 

Consider for example the use of the Stanford Binet Test to 
provide information concerning the intelligence of an 8-year-old 
child. The child's test score would be useless in this connection 
without information about the distribution of the Binet scores of 
8 -year-old children in general to provide a standard against which 
an individual score could be rated. But the administration of the 
Binet test to all 8-year-old children is manifestly impossible, 
partly because of expense and partly because the class of persons 
who are ultimately to be measured by this test extends on into 
the future and includes children who are not now members of the 
population of 8-year olds. The desired norms can be developed 
only on the basis of sample data. 

In this example the population of Binet scores for 8-year olds 
may be presumed to have a normal distribution, whereas in the 
first example the population was composed of two classes. While 
in this example it is impossible to obtain a complete census, in 
the first example it was merely inexpedient. 

3. Testing the effectiveness of a new drug. When a new drug or 
vaccine is developed in a laboratory, its effectiveness must be 
tested by using it on live subjects. The first tests may be made on 
animal subjects, but, if it is intended for human use, later tests 
of its effectiveness must be made on human subjects. 

A naive research worker might assume that all he needs to find 
to prove the effectiveness of a new drug is the proportion of 
subjects who recover from the maiady in question when they are 
treated with the given drug. Thus he would be thinking in terms 
of a two-class population of observations; each person who had 
had the malady and been treated by the drug would be labeled 
either “recovered” or “did not recover.” The proportion who 
recover under the given treatment would indeed provide some 
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measure of the effectiveness of the drug but would not provide a 
basis for deciding whether to advocate its use. That decision 
can be made only after the results are compared with some 
alternative treatment, which may be some other drug or no drug 

at The research worker must, therefore, consider two populations: 
(1) observations on persons who have the malady and are treated 
by drug A and (2) observations on persons who have the maladj 
and^receive treatment B. Each of these populations is composed 
of two classes, those cured and those not cured. Presumably the 
decision to use drug A would be made only if the first population 

has a larger proportion of cures than the second. 
h To sum up, we speak of a population as the collection of all 

possible measures of a certain kind, and we associate with that 

population a frequency distribution. For c0 "'' enle “^ b ut'this 
viduals themselves are ^ %%%££? on 

th“» f ion of individuals is re 

ing to the state (or District. sps . when they 

age ^^present enrollment or endowment or rate 

nf tuition there are populations of quite different natures 

^Randomness It should be fairly clear at this stage of the dis- 

JESS n," d '„ uC “ c 

ill*. »...»»«»>'• - 

selected is a very important conslderat, ^ e f , es is based on 
All statistical theory .nvoWing Uie use of samp ^ ^ ^ 

the notion of randomness. J 0 y t ) lc mm c 

MUt.... i. ... Z,'™ permits 

t. .!» •!..« «< ««'“■ ™* 

will be discussed more full y be '° W e , ; on o{ su bjects is often one 
Achieving randomness in the 1 investigation. Such 

of the most difficult aspects of a statistical 
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investigations have probably been criticized more often for failing 
to achieve randomness than for any other shortcoming. The 
well-known Kinsey studies of sexual behavior are an example of 
studies which have been criticized severely in this respect. The 
problem of achieving randomness is solved differently m each 
field of endeavor and becomes an element of study in that field. 
However, some indications of methods for achieving randomness 

will be pointed out here. 

Suppose, for example, it is proposed to set up an experiment to 
study three methods of teaching statistics. For the purpose of the 
study, suppose the 98 students listed in Table VIII are to be 
divided into three groups, each of which is to be taught by one 
of these methods. It becomes, obviously, a matter of consid¬ 
erable importance to make sure that the groups to be taught by 
different procedures do not basically differ in other ways which 
will influence their learning. Assigning students to the different 
groups in a random manner is the best way to eliminate this 

difficulty. 

To make such random assignment, slips of paper may be num¬ 
bered 1 through 98, thoroughly shuffled, and then divided into 
three groups, just as playing cards might be dealt to three players. 

A less cumbersome way to achieve the same result is the use 
of a table of random numbers like the one in Table XI of the 
appendix. This table consists of digits which have been previously 
randomized, so that their use has the same effect as the shuffling 
just described. To facilitate keeping track of one’s position while 
using the table, the subdivisions of the numbers in Appendix 
Table XI are presented in blocks of five rows and five columns. 
Random numbers of any size can be obtained from this table. 

For making a selection among the 98 students, we find their 
code numbers by choice of consecutive pairs of digits beginning 
with any row or any column. If we use the first 20 digits in line 1, 
we find the following pairs of digits: 10, 48, 01, 50, 11, 53, 60, 20. 
This selection gives us 8 students after eliminating duplicates. 
If we continue in this way until 33 students have been selected, 
the first list of randomly selected students is available. Should 
the number 99 occur, it would be passed over because no student 
has that code number. The table may be entered at any point 
and read in any direction — to right, to left, up the column, down 
the column, or diagonally. Use of a different starting point 
and reading plan will produce a different but equally good random 
selection. Had there been, for example, 4200 students in the 
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population to be sampled, it would have been necessary to read 
four-digit instead of two-digit numbers and to discard all such 

larger than 4200. 

EXERCISE 13.1 

1 Suppose you want to choose 10 students at random from a group of 
260 You decide to begin in the upper left-hand corner of Table XI, 
reading downward. Verify that you would select the code numbers 

104, 223, 241, 94, 103, 71, 23, 10, 70, and 24. , 

2 Suppose you had decided to begin in row 31 and block 6 and to rea 
downward Verify that you would have selected code numbers 183, 

170, 257, 244 .^219, t0 divide rando mly into 

3 ' two P g°ro e ups of 9. You decide to begin at the extreme left of row 40 

a 

Cide ^^“o^would 0 be mlo"tt re n adin P g°from the left 
aXLking UP numbers 4, 2, and 9. These 9 numbers then form one 
group and the remaining 9 the other, 

Group !: Students 14, 12 7, 18 1 31.4 2 and 9 

Group II: Students 5, 6, 8, 10, 11, 13, 15, 10, 

Sample The group of subjects selected from a population for 

....r ~ - jzsixss. 

involving randomnes; . selec t e d P from it by some random 

•<*■>* “ p " u 

called a simple random sample. subdivide a population 

Sometimes it or geographic 

into several subpopulations, say subpopulations. 

groups, and then to sample selected from 

The subpopulations are called strain, a other variations on 

i. * ">f'‘ S& 

random sampling are descri describing the 

relationship of a population to a from 

What is wanted, of course. , » » J^ ” f pract ic„ble way of 

a population of measures, bu _ sample of subjects from the 

obtaining such measures is to population of subjects, and 

collection of subjects loosely called a population 
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then to measure the subjects in this sample. If the subjects have 
been selected at random, the measures of those subjects will 
constitute a random sample from the population of measures. 

Parameter and Statistic. A statistical population always has a 
definite distribution but ordinarily the characteristics of that 
distribution are unknown. If the observations are on a scaled 
trait, the distribution has a mean, a median, percentiles, a standard 
deviation, etc. If the distribution is not scaled, it is characterized 
by the proportion of individuals in its various classes. The char¬ 
acteristics of a population are called parameters; the corresponding 
characteristics of a sample are called statistics. 

The symbols used in this text for the parameters which we shall 
discuss and the related statistics are as follows: 


Characteristic 

Population 

parameter 

Mean 

M (mu) 

Standard deviation 

<j (sigma) 

Variance 

<r 2 

Correlation coefficient 

P (rho) 

Proportions in the two classes of 
a dichotomous distribution 

P,Q 


Sample 

statistic 

X 
8 
8 1 
r 


v , q 


Statistical Inference. Ordinarily, the only information avail¬ 
able to an investigator is contained in the statistics which he ob¬ 
tains from a sample. From this information he makes inferences 
about the population parameters. 

Statistical inferences are usually of two kinds. One is estima¬ 
tion of parameters of a single population. The other is comparison 
of populations. Simple examples of both kinds of inference will 
be described below. 

Law of Large Numbers. The basic justification for statistical 
inference is that the distribution which is obtained from a random 
sample tends to resemble the distribution of the population from 
which it was drawn. This tendency increases as the size of sample 
increases. Consequently, certain statistics computed from the 
sample tend toward corresponding values of parameters of the 
parent population. Thus the mean of a large random sample 
tends to be close to the mean of its parent population. This 
tendency of distributions of random samples to resemble the dis¬ 
tributions of their parent population more closely as sample size 
increases is called the law of large numbers. 

The importance of randomness in the operation of the law of 
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large numbers can hardly be exaggerated. Mere size without 
randomness may not provide the necessary approx.maUon of 

Sa % P lotoevlSSity. Two extreme views, both incorrect, are 
often expressed by statistically u^ined person^ m respect^o 
inferences from samples to populatio . 

»„p k „ » ex.ct reproduction ^ 32 

of statistics compute yiew is t0 hold that information 

values of parameters. The inference about 

from samples is quite valueless as jtofasample 

population values because ®° “ ig the purpose of the discussion 

one happens to have cho * , .... credibility, of informa- 

which follows to evaluate the reliability, or creOiDinry, 

tion obtained from samples artificial 

As a first step in this discussion ; J known d is- 

situation in which we have a ghall draW random samples 

2 JTo ‘.».E computed i,«m the. 

we have chosen a set of scores on haye distr ibution which 

Test of 447 college students. 1 he sc ■ and = 37 .15. 

is very nearly normal with P* ra ™ of students to whom 

The list of scores, together wi XXIV of the authors’ 

<-* «*«. ,T und “Lndm, ot ,1m, folio,, 

is in no way dependent on referencetotheset^ ^ ^ 

We now continue the fiction -y PP rame t e r m by random 
separately seek information a ° U draws randomly a sample of 1, 
sampling. Each of these pe P compu tes the sample mean 

a sample of 5, and a sample of 25, ana con p 

X for each sample. i s hows the distribution 

In Table 13.1, the column headed N - ^ obviously , 

of the means of 20 random samp es v£due 0 f tbe 0 bserva- 

the mean of a sample of 1 case is m of tb j s column would 

tion itself. The mean and standard ^ but tQ differ some- 

therefore be expected to a PP roxl ™ J* Ung errors. You should 
what from them because of ran ,7 cver> a n attempt to esti- 
note that this is what happened. individual would expose 

mate m from information about a sing While the mean of 

the investigator to risk of a very ar 195 a. pag e iso. 

* H. M. Walker and J. Lev, Statical Inference, Henry 
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TABLE 13.1 Distribution of Means of 20 Samples Drawn from a Normal 
Population with fx = 121.6 and a = 37.15 when N = 1, 

N = 5,N = 25 


Score interval 


Number of samples 

N = 1 

II . 

O' 

II 

8 

165-169 

2 


160-164 

1 


155-159 

1 


150-154 

3 


145-149 

1 

1 

140-144 

2 

2 

135-139 


1 

130-134 

1 

2 1 

125-129 


2 5 

120-124 

1 

1 6 

115-119 


2 5 

110-114 

1 

1 3 

105-109 

1 

4 

100-104 

2 

2 

95-99 



90-94 



85-S9 


1 

80-84 


1 

75-79 



70-74 

1 


65-69 

1 


60-64 



55-59 

1 


50-54 



45-49 



40-44 



35-39 

1 



Total 

20 

20 

20 

* Mean of 20 means 

122.5 

119.0 

121.7 

* Standard deviation 




of 20 means 

39.5 

18.0 

5.9 


* Computed from ungrouped values 

the column for A = 1 comes close to /x, one sample had a value 
of only 36 and one had a value of 169. The range of estimates 
of n based on these 20 samples was therefore 169 - 36 = 133. 

Now examine the column headed N = 5. The observed means 
of the 20 samples recorded here have considerably less variability 
than those in the column N = 1, the lowest being 84.1 and the 
highest, 144.2, so that the range is 144.2 - 84.1 = 59.8. Obvi¬ 
ously, the error made in estimating /x from 1 sample of 5 cases is 
likely to be numerically smaller than the error made in estimating 



211 

SAMPLING DISTRIBUTIONS 

a from a single case. However, a person might happen to draw a 
single case with score almost exactly equal to m and then mig it 
happen to draw a sample of 5 cases with a fairly extreme value of A. 

Look now at the column headed N - 2o Here the sma 
mean in the 20 samples was 112.9 and the largest was 128.0, so 

that the range was only 128.0 — 112.9 lo.l. 

The three distributions in Table 13.1 suggest certain generahza- 

c ohrtiit the validity of an estimate of m made from a sample 
value of X. These generalizations can be obtained by mathemati¬ 
cal reasoning and are supported by many empirical studies. They 
app^y only on the assumption that sampling is random. These 

generalizations are: 

(1) The value of X obtained from a sample can be expected to 

differ somewhat from fx. arrnr ~ V - a are 

(2) In computations from many samples, the errors X M a 

(2) both positive and negative and the average error tends 
toward zero. In other words, the average value of X 
in many samples tends toward M , regardless of sample 

(31 As'sample size increases, the variability of the distribution 

which are: N ** l A = 5 N = 2o 

133 59.8 15.1 

Range of X 39 5 18.0 5.9 

Standard deviation of X individua l observations in 

Sampling Distributions. deviation similar to those 

Table 13.1 had a mean and a stand d d continued 

of the parent population, M “d -mp^g ^ ^ tQ 

indefinitely, that mean * close)y . The 20 means of samples 

approach p and cr more a ^ mean of the parent 

of 5 cases also had a mean (119.0 " ‘ draw „, the mean 

population, and if and more closely, 

of their means would tend PP^ ^ the samples of 25 cases 

A similar statement can be standard deviation of 

However for samples larger than ne the ta deviation , ,, 

the set of means was smaller than the number 

of the parent population, and would remain 

of samples were vastly increased. 
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The mean actually has a distribution of its own which can be 
ascertained if the distribution of the parent population is known. 
In the previous paragraph we have noted that the variability of 
the distribution of the mean appears to depend on sample size, 
and that relationship will presently be made explicit. 

For the samples used to obtain Table 13.1, many other statistics 
might have been computed and recorded. Thus, a distribution of 
medians might have been obtained, a distribution of variances, a 
distribution of ranges, a distribution of standard deviations, etc. 
In referring to such distributions, it is customary to refer to the 
statistic in the singular. Thus, the “distribution of the means 
of indefinitely many random samples of size N” is called “the 
distribution of the mean,” just as the distribution of a set of 
heights is called the distribution of height rather than heights. 
Every statistic has a distribution which depends upon the dis¬ 
tribution of the parent population and on sample size, but the 
distributions of different statistics are not the same. The distri¬ 
bution of a statistic is called the sampling distribution of that statistic. 

A sampling distribution is itself a population. It differs from 
the parent population by the fact that its elements are values of a 
statistic rather than single observations. The individuals on which 
a sampling distribution is based are samples rather than individuals 
of the parent population. A sampling distribution has parameters 
which are related to the parameters of the population of observa¬ 
tions from which the sampling distribution is derived and to the 
number of cases in the samples. 

The Normal Distribution for Statistics. When the parent dis¬ 
tribution is normal, the distribution of the mean is also normal. 
When the parent distribution is not normal, the sampling distribu¬ 
tion of means is not normal for small samples but becomes very 
nearly normal as sample size increases. For many (but not for all) 
statistics the sampling distribution is approximately normal for 
large samples. For this reason the normal distribution is very 
important in statistical inference. However, for many purposes 
the normal distribution is insufficient for the needs of statistical 
inference, and several very important non-normal distributions 
are used extensively in more advanced work. Even the elementary 
problems considered in this chapter will require the use of certain 
non-normal distributions. 

When the sampling distribution of a statistic is normal, or 
approximately normal, the mean of that sampling distribution is 
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either a parameter of the parent population or an expression 
mathematically derivable from the parameters of that population. 
The same is true of the standard deviation of the distribution of 

the statistic. . ....... 

The Standard Error. The standard donation of the distribution 

of a statistic is called the standard error of the statistic. The standard 

error is especially important in dealing with statistics which are 

normally distributed. For such statistics the standard error 

provides information concerning the probability that a statistic 

will deviate from its parameter by a specified amount, the 

standard error of the mean will be denoted and is given by 

the formula 


(13.1) 


(7 v = 


VN 


To illustrate the meaning of tr* let us look again at the data of 
Table 13.1. Since the standard deviation of the parent popu a urn 
was known to be <r = 37.15, we can compute cr* or each of the 
three sample sizes and compare that value with the obsenc< 

> t / v . . 1 _ .-m ♦ I'll.' • 




Observed standard 

Size of sample 


deviation 

N = 1 

37 15 - 37.15 

Vi 

39.5 

iV = 5 

3710 - 1G.6 
\/5 

18.0 

tO 

II 

3715 = 7.4 
V25 

5.9 


Even for as few as 20 samples, the correspondence bctweenUie 

expected value ay and the observe va ue s • distributions 
see what we might have predicted about the Uw*« ^ 

of Table 13.1 from knowledge of the h> t 1 pu i a(i on 

a - 37.15 for the parent P»P of P thc samp l e mean 

was approximately norma the d ^ perccnt of lhc 

may be assumed to be normal. , 

area under a normal curve lies between -IM° 

Therefore, if a great many samples are dra^^ ^ ^ ^ ^ 
population, we should expect about . ■ P us 

means lying between „ IS**, values, 

compare the observed distributions wit 
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Size of 

Value of 

Range from 

Observed means 
outside range 

sample 

1.9G<r f 

/jl — 1.96<r; to n + 1.960’; 

Number Percent 

AT =T 

72.8 

48.8 to 194.4 

1 

5 

N = 5 

32.5 

89.1 to 154.1 

2 

10 

N = 25 

14.5 

107.1 to 136.1 

0 

0 


EXERCISE 13.2 


Assume that random samples are drawn from a normal population 
for which /x and tr have the values shown below. For samples of the 
size indicated, what would be the mean and the standard error of the 
sampling distribution of X? Within what range would the middle 



95 percent 

of the 

sample means fall? 

The middle 50 percent? 






Range of 

Range of 


Population 


Distribution of X 

middle 95 

middle 50 


parameters 

N 

Standard 

percent of 

percent of 



a 


Mean error 

sample means 

sample means 

1 . 

60 

12 

9 

60 4 

52.2 to 67.8 

57.3 to 62.7 

2. 

60 

12 

64 




3. 

40 

12 

36 




4. 

40 

18 

36 




5. 

92 

8 

100 




6. 

53 

25 

100 




7. 

39 

15 

900 




8. 

39 

15 

144 





Estimation by a Single Value. In a previous paragraph of this 
chapter, we indicated two extreme views toward statistical in¬ 
ference, one that a sample provides an exact reproduction of the 
population, and the other that a sample provides little or no 
information about a population. 

We have now reached a stage where we can discuss this 
problem somewhat rationally. We have found that the mean 
of a random sample is likely to be not far from the population 
mean, provided the sample is not too small. On this basis we feel 
justified in calling the sample mean an estimate of the population 
mean. We have used here the expression “estimation by a single 
value” in order to distinguish it from estimation by an interval 
which we shall describe later on. An estimate by a single value 
is also called a point cstunaic, because it represents a point on a 
scale of possible estimates. 

The reasoning by which we have justified using X as an es¬ 
timate of /x can also be used to justify using s as an estimate 
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of <s using a sample percentile as an estimate of a population 
percentile, and using other statistics as estimates of other param- 

ete Some questions about point (single value) estimates remain to 
be answered. We note, for example, that for a norma popu^ n 
„ is n ot only the mean but is also the median. One may, theretore 

raise the question as to the possibility of using the ample 
as a basis for estimating m for a normal populate. Ano h 
question of some interest is the proper estimate for <r . Shall 

be the estimate which is used in tMs t6Xt ’ “' e 

timate used by most of the older books and by a few 

° £ tU 6 “for choosing estimates will now be discussed 

an 

Thus, no matter what the size of a sample may ^ normal 

unbiased estimate of the pop , biased estimate of the 

population, the — uiednm^ mi unlua^e^ ^ ^ a 

* re ,h * f , 
2(X - A) n-i vps .in unbiased estimate of o , 

The formula, s 2 = JfZT\ b 

. 'ZjX - j oes no t. This is the 

but the alternative formula, iV 

\ „f N — 1 rather than A in the de- 

chief justification for the us ^ grcat man y samples of 

nominator. For example, PP population in which 

9 cases each were drawn from nor^ ^ jY _ , . s 

<r 2 = 144. If for each sample • e ^ varia nce» would 

as the denominator, the mea ^ However, if A r = » wcrc 

be approximately equal to * sam ple variances would 

used as the denominator, the mean of the sample 

a-(N - 1) = 8(144) = 128 instead of 144. 
be approximately tt 9 , „ 

,3, P ««n.. - jsfrrsrs: 

medians for 20 samples o t j iat the means of the 

computed for the same samp es. i y t be same and are 

two distributions of statistics arc ^ lnean 

close to This illustrates the s ^^d Uimate of „ for a 
and the sample median are both an unb.ascu 

normal population. 
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Efficient Estimate. An interesting feature of the two dis¬ 
tributions in Table 13.2 is that the medians are much more 
variable than the means. This is a reflection of the fact that the 
standard error of the median is greater than the standard error 
of the mean. 


TABLE 13.2 Distributions of Means and Medians of Random Samples 
of Five from a Normal Population with p = 121.6 and 

a = 37.15 


Score interval 

Number of 
sample means 

Number of 
sample medians 

160-164 


1 

155-159 


1 

150-154 


1 

145-149 

1 

1 

140-144 

2 

1 

135-139 

1 

1 

130-134 

2 


125-129 

2 

2 

120-124 

1 

2 

115-119 

2 

1 

110-114 

1 

2 

105-109 

4 

2 

100-104 

2 

1 

95-99 

90-94 


2 

85-89 

1 

1 

80-84 

1 


75-79 

70-74 


1 

Mean of distribution 

119.0 

119.5 

Standard deviation 

of distribution 

18.0 

25.3 


Because of this difference in standard errors, the mean of a 
sample is likely to be nearer to n than its median, or, in other 
words, the error in using the sample mean as an estimate of n is 
likely to be less than the error in using the sample median for this 
purpose. 

Since the standard error of the mean is less than the standard 
error of the median, the sample mean is said to be a more efficient 
statistic for estimating n in a normal population. When two 
statistics are available for estimating the same parameter, the 
one with the lesser standard error is the more efficient. 

It should be pointed out that in a population in which the mean 
and median are presumed to differ, the sample median should be 


STANDARD SCORE FOR MEANS 

used for estimating the population median, and the sample mean 
used for estimating the population mean. This conclusion is 

based on consideration of bias. 

The Standard Score for Means. In Chapter 7 the standard 
score for an observation was defined as 

X-X 

(13.2) * = —“ 

This expression is entirely in sample terms. 

In Chapter 12 the corresponding expression in terms of param¬ 
eters was given as 

X-n 

(13.3) z = —— 

We learned that, if X is normally distributed, the latter expression 

has the standard (or unit) normal distribution. 

Corresponding to formula (13.3) for observations, the standard 

score for means is 


x- 


(! 3 - 4 ) 2 - J^JVN) 

Regardless of the size of N, the standard score in formula (13 4) 
is a standard normal deviate if X is distributed normally with 
mean „ and standard deviation <r. Unless N >s wy smaH^ this 
standard score is distributed approximately normally even 
distribution of X is not normal. It may be helpful to the read 
to notice that for means based on 1 observation (X = 1) formula 

(13.4) would be identical with formula (13.3). , 

Because of the normality of distribution of the standard score 

for means, we can apply the table of the normal distribution 
answer questions like the following. 

(1) Assume that we are drawing samples o size ^ 

population with m = 60 an ^ a = 8 ’ For w 1 a large? 

samples will X exceed 61 if the number of samples is very 

Solution: 

61-60 1 


z = 


= .625 


(8/V25) 18 

In a normal distribution, the area above the point * “ ' fi ^ “ 
.5 - .2340 = .2660. Hence X may be expected to exo^ m 

about 27 percent of samples. A better stateme 

probability .27 that X will exceed 61 ding uest i 0 ,., 

(2) Assuming the same situation as P 
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between what two limits will the values of X for the middle 
80 percent of samples be contained? 


Solution: 

The middle 80 percent of the area under a normal curve is 
contained between 2.10 = — 1.282 and z.w = + 1.282. For the 
sampling distribution of X, the corresponding points are obtained 

r . . , .. X - 60 , OQO , X - 60 

from the two equations , = - 1.282 and , 

The solutions of these equations for X are 57.95 and 62.05 which 
are the required limits, and we may say there is probability .80 
that X will not be less than 58 nor more than 62. 


= 1.282. 


The expression for the standard score for means (13.4) which 
uses the population value a is of great theoretical interest. Its 
direct application to data is limited to those situations where an 
estimate of a is available from earlier experience. In the most 
common situations, <j is not known and s must be used as a 
substitute. The standard score for means is then called t rather 
than 2 . 

(13.5) t = 

This t is not normally distributed, but for large samples its dis¬ 
tribution is sufficiently like the normal so that the tables of the 
normal distribution may be used. Questions which arise in this 
situation are how large a sample should be in order to make the 
normal approximation applicable and how to deal with problems 
for which it is not applicable. These questions will be taken up 
on page 221. 


(s/ VN) 


EXERCISE 13.3 

1. If random samples of size 36 are drawn from a normal population for 
which n = 57 and cr = 12, what proportion of such samples will have 
a mean 

(a) Greater than 57? 

(b) Greater than 60? 

(c) Smaller than 59? 

(d) Between 55 and 59? 

2. In the situation described in Question 1, what is the probability that 
the mean of a random sample will be 

(a) Greater than 53? 

(b) Between 54 and 60? 
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3 Suppose random samples of 64 cases are drawn from a normal population 
for which M = 90 and <r = 18. Consider the 5 percent of samples with 
means which deviate most widely from p. Outside what %alues will 

these means fall? 

Solution: For half of them be larg6r tha “ ^ and 


(i/s/N 

half it will be smaller than 2 . 026 . 

<T 18 


= 2.25 


If 


If 


y/N \/64 

zou = - 1.96 and 2.976 = 1-96 

X-90 
2.25 

X-90 
2.25 


= - 1.96, X = 90 - 1.96(2.25) = 85.6 
= 1.96, 7 = 90 + 1.96(2.25) = 94.4 


Therefore the most extreme 5 percent of the samples will have X either 

smaller than 85.6 or larger than 94.4. . . ... 

4. Suppose random samples are drawn from a normat populaUon or whm 
M = 75 and <r = 12. Between what two values will the middle 9J per 

cent of sample values of X fall if Nis (#) ^ 

(b) 16? (J) (f) 90 °' , . 

i „_i„ nf 900 cases is drawn from a population for 

s - sk : srtrSii”r—**»*« 

probability 95 that X will fall d » » (#) , Q? 

(b) 10? («*) 15? « 2!? 

The Confidence Interval. Estimation of a^ parameter by, a 

single number is valuable because t te earn ^ wil| 

close to the value of the para mete . ^ parameter, 

almost never be exactly the same aslie va^ ^ oint cstilnati „n 

however, it is somewhat um>at • * >• t |, e pa ram- 

does not sufficiently tie down our informatior 

Another type of estimate is 

same sample two statistics and using ametcr . These two 

statistics as an interval estimate ‘ interval between 

statistics are computed in such a way that 

them usually contains the parameter Bj “■ > interva , 

that for some preassigned percentage of Qther 

between the computed values con ai t con i a in the 

words, /or any given sample the interval may or may not 
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'parameter , but the computation of the interval is of such nature that 
for some preassigned percent of samples the interval contains the 
parameter. An interval obtained in this way is called an interval 
estimate or a confidence interval; the two statistics which determine 
the limits of the interval are called confidence limits; and the pre¬ 
assigned percentage of samples is called a confidence coefficient. 
These concepts will be clarified and made more precise when we 
develop them for the problem of obtaining a confidence interval 
for p. 

Confidence Interval for the Mean when the Sample Is Large. 
The method of obtaining this interval for a sample of 50 or more 
cases will first be described, and then a justification will be given 
for the method. 

Suppose we have drawn a random sample of 125 cases and we 
wish to compute a confidence interval for p with confidence 
coefficient .99. In other words, we wish to compute a confidence 
interval of such nature that in 99 percent of similar computations 
the interval will contain the parameter. We compute X and s 
from the sample, and find z . 9fl5 and z.m from the table of the normal 
distribution. From these values we compute the two statistics, 

X + z 995 —p= and X + z .006 —p=' These are the two confidence 
v 125 V125 

limits. The interval between them is the confidence interval. 

Suppose, in the above example, X = 93.5 and s = 14.2. In 
Appendix Table II of the normal distribution, we find z . B 95 = 2.576 
and Z. 00 & = - 2.576. Then the limits are 

93.5 - 2.576 

V\/l25 


J = 93.5 - 3.3 = 90.2 


and 


93.5 + 2.576 = 93.5 + 3.3 = 96.8. 


Consequently the interval between 90.2 and 96.8 is the required 
confidence interval, and we state, with confidence coefficient .99, 
that 90.2 < p < 96.8. This is read “ p is greater than 90.2 and 
less than 96.8” or “p is between 90.2 and 96.8.” 

In more general terms, suppose the sample size N is sufficiently 
large to permit use of the normal approximation, and the confidence 
coefficient is c, then confidence limits for p are 


X + Zi-e 




and X + Z\+ C r - 

2 v N 


(13.6) 



student’s distribution 
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The following justification for the limits in (13.6) is somewhat 
mathematical and may be skipped by a reader who wishes to do 
so. We know from the discussion of the tables of the normal 
distribution that the percentage of samples between z\-e and 


Zl+c is 


1 + c 1 - c 


= c. 


means, the percentage of samples for which 


In terms of the standard score for 

T? __ 

—-is between 

(s/VX) 

Zi—e and zi+c is c. The statement can be put mathematically by 
2 2 

saying that the inequality 

(13 - 7) < wiffi K 

is true for a proportion c of samples. By simple algebra, inequality 
(13.7) can be transformed into inequality, 


(13.8) 


X + zi-e— t= < M < X + * 1 + 577 =? 

TVN * v * 


and this latter inequality is true whenever the former is true. 
Therefore c is also the proportion of samples for which inequality 

(13.8) is true. , , .... 

It is important for the reader to realize that the probability 

expressed by (13.8) is about the intervals containing p. It is not 
a statement about possible value of /z in the given intcr\a . n 
other words, the inequality (13.8) might mislead a reader to think 
that there is a collection of populations, each with some \a uc 
of n and that a proportion c of these populations has values of m 
in the interval indicated by the inequality. This is an lruorrec 
interpretation of the inequality. The correct interpret ion 
that there is one population with a unique m- There are, owever, 
as many possible intervals as there are possible samples, borne 
of these intervals satisfy the inequality because t k> ac 11,1 ^ 
contain jn between their limits. Some do not. > ie 
stated, the proportion of intervals which satisfy (13.S) is c. 

Student’s Distribution. We have stated above that, i the 
population has a normal distribution, the samp c 

also have a normal distribution, and that if N is 


X- 


(< t/VN) 


„ . . . . , X - n distribution which 

sufficiently large the values l = ,iavc d 

approximates the normal. For small samples, say A 40 or 
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the normal approximation for t is inadequate, and its exact dis¬ 
tribution must be used. This distribution is called either “Stu¬ 
dent’s ” 8 distribution or the t distribution. It is a symmetrical 
distribution, with mean zero, but its form differs for different 

values of N. . . 

A difficulty in using Student’s distribution, one not met in 

using the normal distribution, is that we need a separate proba¬ 
bility table for each sample size. The situation is somewhat 
further complicated by the fact that the tables are not stated 
directly in terms of numbers of cases, but in terms of a related 
value called degrees of freedom which we shall denote by n. The 
relationship between number of cases and number of degrees of 
freedom differs from one type of problem to another, and must 
be ascertained separately for each type. The concept of degrees 
of freedom is discussed in the authors’ advanced book. A general 
understanding of that concept is not needed at this time. It is 

enough now to know that for t = the number of degrees 

of freedom is one less than the number of cases, or n = N - 1. 

Tables of areas for Student’s distribution are found in Ap¬ 
pendix Table V. Each row is really a separate table corresponding 
to the number of degrees of freedom indicated in the left-hand 
column. The reader will notice that the heading of this column 
is n, which is our customary notation for number of degrees of 
freedom. Tabular entries are values of t. In the column headed 
£. 90 , each entry is the 90th percentile of the distribution to which 
it belongs. A similar description applies to the other columns. 
Since the distributions are symmetrical with mean zero, it is not 
necessary to tabulate percentiles below the 50th. Instead we use 
the relationships = - t.w, £.oos = - £. 995 , etc. The entries in the 
bottom row are percentiles of the normal distribution. 

EXERCISE 13.4 

READING A TABLE OF STUDENT’S DISTRIBUTION 

1 . By examination of Appendix Table V, verify enough of the following' 
statements to be sure you are reading the table correctly. 

* “Student” was a well-known British statistician named William Sealy Gosset 
(1876-19-17) who was adviser to the Guinness brewery in Dublin. A ruling of that firm 
forbidding their employees to publish the results of research was relaxed to allow him to 
publish mathematical and statistical research under a pseudonym. Hi9 paper on “The 
Probable Error of a Mean,” published in Biometrika in 1908, which first called attention 
to the fact that the normal curve does not properly describe the distribution of the ratio 
of mean to standard error in small samples, is now a classic. 



EXERCISE 


223 


(a) The 95th percentile of Student’s distribution is 

(1) t = 2.35, when n = 3. 

(2) t = 1.94, when n = 6. 

(3) t = 1.74, when n = 17. 

(b) The 5th percentile of Student’s distribution is 

(1) t = - 2.35, when n = 3. 

(2) t = - 1.94, when n = 6. 

( 3 ) t = - 1.74, when n = 17. 

(c) For Student’s distribution with n = 5, 

(1) 10 percent of the area lies to the right of an ordinate at l = 1.48. 

(2) 25 percent of the area lies to the right of an ordinate at t = .73. 

(3) 1 percent of the area lies to the right of an ordinate at t = 3.30. 

(4) 80 percent of the area lies to the right of an ordinate at t = - .92. 

(5) 99.5 percent of the area lies to the right of an ordinate at 

t = ~ 4.03. 

(d) For Student’s distribution with n = 12, 

(1) 50 percent of the area lies between t = - .70 and t - .70. 

(2) 80 percent of the area lies between t = - 1.36 and t = 1.36. 

(3) 90 percent of the area lies between t = - 1.78 and t = 1.78. 

(4) 95 percent of the area lies between / = - 2.18 and i = 2.18. 

(5) 98 percent of the area lies between t = - 2.G8 and t = 2.68. 

(6) 99 percent of the area lies between t = - 3.05 and i = 3.0o. 

2. In the following questions you may assume that we are considering 



(b) 

(c) 


y ~ £ and that the number of degrees of freedom is n - N 1. 

If'# = 10, at what point on the scale of the t distribution would 

you draw an ordinate so that .. 

(1) 5 percent of the area would lie above it (i.e. to the right). 

(2) 10 percent of the area would lie below it (i.e. to the left). 

(3) 80 percent of the area would lie above it? 

(4) 99 percent of the area would lie above it. ? 

Answer the same four questions, if N = 10. . 

If at = 7 at what two points on the scale of the t distribution would 

you draw ordinates so that they would include between them 

(1) The middle half of the area? 

(2) The middle 80 percent of the area? 

(3) The middle 90 percent of the area? 

(4) The middle 95 percent of the area? 

(5) The middle 99 percent of the area? 


3. Answer the same five questions, if N 20. 

4 . What percent of samples of size N- 15 would have values of 


X- 




s/v'iV 

between -2.14 and 2.14? Does your answer depend on the value of m? 

Of s? Of A? 
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Confidence Interval for the Mean when the Sample Is Small. 
The reasoning which led to the confidence limits given in formula 
(13.6) for large samples leads also to the confidence limits for small 
samples, with the only modification being that z is replaced by t 
and that reference is made to the t distribution instead of to the 
normal. For a small sample, the confidence limits for /z can, 

therefore, be obtained as follows. . # 

Suppose that a random sample of size N is drawn from a 

normally distributed population, and that the confidence coefficient 

is to be c, then the confidence limits for /z are 

(13.9) J + and 1 + t '^'VN 

Values of h- c and ti±c are read from Appendix Table V with 

T 2 

In Table 13.3 is shown the confidence interval for each of the 
20 samples of 25 cases which have been previously discussed and 
for which the distribution of means was given in Table 13.1. We 


TABLE 13.3 Confidence Interval for /z with Confidence Coefficient .90, 
for Each of 20 Random Samples of 25 Cases from a Normal 
Population with /z = 121.6 and tr = 37.15 


Sample 

number 

Y 

s 

Lower limit 

Upper limit 

Includes 

M? 

1 

123.7 

36.9 

111.1 

136.3 

Yes 

2 

116.0 

31.9 

105.1 

126.9 

Yes 

3 

117.9 

25.6 

109.1 

126.7 

Yes 

4 

126.7 

35.9 

114.4 

139.0 

Yes 

5 

128.0 

41.4 

113.8 

142.2 

Yes 

mm 

6 

124.5 

38.1 

111.5 

137.5 

Yes 

mm 

7 

112.9 

38.7 

99.7 

126.1 

Yes 

mm «p 

8 

123.8 

34.3 

112.1 

135.5 

Yes 

mm <m 

9 

118.8 

39.0 

105.5 

132.1 

Yes 

10 

127.6 

39.7 

114.0 

141.2 

Yes 

11 

128.3 

37.0 

115.6 

141.0 

Yes 

12 

122.1 

41.7 

107.8 

136.4 

Yes 

mm 

13 

125.6 

36.0 

113.3 

137.9 

Yes 

14 

111.0 

30.1 

100.7 

121.3 

No 

15 

111.4 

39.6 

97.9 

124.9 

Yes 

16 

123.S 

31.8 

112.9 

134.7 

Yes 

17 

117.8 

37.2 

105.1 

130.5 

Yes 

18 

117.3 

37.8 

114.4 

130.2 

Yes 

19 

133.6 

34.7 

121.7 

145.5 

No 

20 

123.0 

41.4 

- 108.8 

137.2 

Yes 
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recall that these samples were drawn randomly from a population 
for which n and <r are known, which is, of course, a fictitious 

situation, because parameters are ordinarily not known 

Entries in the column at the extreme right show that 18 of the 
20 confidence intervals include „ = 121.6 and 2 do not. Inclusion 
of fi in the interval is indicated by “Yes and nonmclusion by 
“No” Here the proportion of samples (70 .90), in w ic m 

is included between the computed confidence intervals, is exactly 
equal to the confidence coefficient of .90. However one must 
not expect such perfect agreement of experience and theory every 
“Se an interval estimate is made. The value c - .90 applies to 
the totality of computations from a very large number of samples 
rather than to any fixed number of them. It is interesting 
note in Table 13.3 that, even when the confidence interval missed 

rhp true value of u, it missed it only slightly. 

It will be of interest to the reader to check some of the compu¬ 
tations in Table 13.3. We shall describe the first of these. 

As the confidence coefficient is stated to be c .00 

, 1+c 1.00 

= .05 and —= ~o 


1 - c .10 


= .05 


To use Student’s distribution, we first note that n = 25 1 - 24j 

rs. di sr'i- - 

sample 1 are 

36.0 

J + tM = 123.7 + (- 1.7D ^ 


= 111.1 


and 


36.0 


J + ln VR = 123 ' 7 + (1 ‘ 71) ^5 


= 136.3 


EXERCISE 13.5 

1. A sample of 16 cases has yielded X = 2&a^and 8 ^ 

Make an interval estimate for with confidence ^ = ^ 
(a) c = .99, (b)c-.95, 

2. A sample of 400 cases has 

Make an interval estimate fo M = (J q 

( a) c = .99, (b) c = .95, W 

3. A sample of 16 caseshas yielded X 

Make an interval estimate for n 
(a) c = .99, (h) c = 95 - 
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That aspect of statistical inference which deals with estimation 
of unknown parameters of a population was discussed in the 
preceding chapter. Of the three examples involving the use of 
samples described at the beginning of that chapter, the first two 
illustrate problems in which the main task is the estimation of 
parameters. The problem presented in the third example differs 
from those in the first two in that it requires a choice between 
two alternatives: (1) continued use of the medical treatment now 
in use, or (2) substitution of the new kind of treatment which is 
under consideration. These alternatives can be reduced to sta¬ 
tistical considerations if we say that the old method will be con¬ 
tinued if the new method does not cure a larger proportion of sick 
persons than the old. However, if the new method does produce 
a larger proportion of cures, then it will replace the old method. 

The approach to this problem is to formulate a hypothesis that 
some particular relation between the proportions of cures is true, 
and then to test the hypothesis experimentally. On the basis of 
the outcome of the experiment, a conclusion is drawn about the 
relative effectiveness of the two methods of treatment. 

Methods of statistical inference involving proportions will be 
described in the next chapter. In this chapter we shall develop 
methods required in dealing with hypotheses about means. Two 
applications will be discussed. 

A Problem Involving a Hypothesis about Means. Suppose 
the question has been raised as to whether people who are deaf 
and those with normal hearing differ in motor abilities. By now 
the student should have developed far enough in statistical think¬ 
ing to be unwilling to accept a generalization about this matter 
based only on a few personal experiences, “I know several deaf 

persons who . . .” He should also be unwilling to accept a 
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generalization based only on general theories of physiology. Al¬ 
though one should recognize that either abstract theorizing or 
personal experience may lead to a formulation of a useful though 
tentative generalization, an appeal to empirical data is essential 

to establish that generalization. , . , 

In a research reported by Long, 1 deaf and girls having normal 

hearing were compared on a number of motor abilities^ Inasmuc 

as motor ability increases with age, if one group had been older 

than the other, a difference might erroneous* 
terpreted as related to deafness. For fear that the motor traits 
studied might be related to intelligence or that perfomance m gh 
be affected by the ability to concentrate and follow directions, it 
Ssotemed important to control intelligence, ^cre are sev 
ways to arrange an experiment for the purpose of controlling the 
effect of background traits which might bias the outcome^ L g 
paired each deaf girl with a girl having normal hearmg of the 
same age and intelligence quotient. Such pairing is usually 
laborious and not always the most efficient device but h is very 
freauentlv employed. The data which Long reported on 37 pairs 

discussion, only the first ten pairs have been used. Lo g P 

sented findings on several traits of which only balance '' ill e 

renorted here The method of obtaining a score for each child 
reported nere. . . fn _ the research but extraneous 

on a test of balance is important tor tne re.ea 

to ^ 
that of her hearing mate, leaving a differ* :score ^.,ch may-be 

denoted X. It should be recognised that X a measure J 

enctfaS and"ta“u£ce. Terences related1 tc. train¬ 
ing or experience will presumably be eliminated 
measurement those tasks the c .. control sources 

KSXT£2rc^hich niight t ou £ bias the outcome 

must be controlled. Differences related to m iv « P 
will always remain. Deaf children arc not uniform 

■ John A. Long, Mo,or Mi,Hi. of Deo, OMron, C °"™ 

"<;»», **** ln,ercnc *■ Hcnry Ho11 ' 

page 382. 
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ability, nor are children with normal hearing. Even if deaf and 
hearing children are generally equal in balance, very few pairs 
will have zero difference. Sometimes a deaf child whose balance 
is better than average is mated with a hearing child whose balance 
is poorer than average, producing a negative difference score. 
Sometimes the accidents of pairing will produce a positive dif¬ 
ference. The population of such differences may be assumed to 
have a normal distribution with mean p. If there is no difference 
between deaf and hearing in general, p = 0. If hearing children 
have better balance than deaf, p > 0, although for many pairs 
the difference X may be negative. If hearing children have 
poorer balance than deaf, p < 0, although for many pairs the 
difference X may be positive. 

A Statistical Hypothesis. A hypothesis about p is a tentative 
statement that p has some particular value or that /x has some 
one of a set of values. After it is formulated, the hypothesis is 
subjected to an experimental check known as a test of the hy¬ 
pothesis. 

Suppose your knowledge of physiology inclines you to believe 
that p is greater than zero; however, you are not inclined to guess 
how much greater. You might set up the statistical hypothesis 
/x = 0 and test it against the alternative p > 0. If then the 
statistical hypothesis p = 0 is found to be unacceptable, you 
would have grounds for asserting your original belief; if the 
statistical hypothesis is found to be acceptable, you would be 
warned that you could not safely assert your belief that /x > 0. 

Suppose that a researcher is unaware of the connection be¬ 
tween balance and the semicircular canals in the ears, and has no 
prior opinion as to the value of p. He merely wants to find out 
whether any general difference exists. He will then set up the 
hypothesis p = 0 and test it against the alternative “ p is not 0.” 

The statistical hypothesis tested is often called the null 
hypothesis. A hypothesis is always a hypothesis about a parameter 
(or sometimes about more than one parameter) of a population, 
never about a statistic of a sample. It is stated in the present 
tense, that some general state of affairs is now existent. It should 
not be stated in future tense (“the mean will be zero,” “the 
correlation will be positive” etc.), because that phraseology sug¬ 
gests a kind of fortune telling about the outcome of a particular 
sample. 

Test of Hypothesis that p = 0. To test this hypothesis a ran¬ 
dom sample must be drawn; in the present problem that will be a 
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TABLE 14 1 Scores on Balance for 10 Pairs of Deaf and Hearing 

Girls * 


Pair 

Number 


Balance score for 
Hearing Deaf 


X 

Hearing-deaf 


1 

2 

3 

4 

5 

6 

7 

8 
9 
10 


2.3 

1.0 

3.7 

3.3 
10.0 

2.7 

8.3 

0.0 

4.3 

7.7 


Total 

X 

s 


49.3 


2.0 

2.0 

2.7 

2.7 

3.0 

2.7 

3.7 

1.3 
2.0 

4.3 

26.4 


0.3 

1.0 

1.0 

0.6 

7.0 

0 

4.6 

4.7 

2.3 

3.4 


8 


/Vat 


22.9 

2.29 

2.56 

0.810 


* Source of data: Long, op. cit. 

sample of matched pairs. The data for 10 such Pai^iU be found 
in Table 14 1 Each pair-difference has been denoted A and 
Itistics X = 2.29, a - 2.56, and s, VN = 0.810 have been com- 

puted. The standard score for means is 

X - M 2.29 - 0 


t = 


t /y/N 


0.810 


2.83 


that, if the hypothesis M 0 • - with the largest values 

sample which is in the l percent f camnle is ex- 

of t We are inclined, therefore to say ^ s 

ceptional, or unusual, for samples from a population 

M = , n nlv T s and N are known, and from 

Now „ » not known. 0n >; * ctcd about the reasonable- 

these statistics evidence must ^ that hypo thesi.s leads to a 

ness of the hypothesis, n 0 ran dom sample, it 

value of ( that would occur onl^ rarety. ( Hcnce we 

does not seem a reasonable hyp we , mve made this de- 

reject the hypothesis that /* • , . i 

cision, our test of the hypothesis's to • should be rc- 

. The reader may feel that ^ ^ is obtaine d in a 

jected whenever a value of X 
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sample. Such a policy would be mistaken because it fails to take 
account of sampling variability. Even if /z is actually zero in 
the population from which samples are drawn, the sample values 
of X will vary so that almost none of them will be precisely zero; 
most of them will deviate from zero by a small amount in one 
direction or the other, and only a few will show large deviations 
from zero. Consequently, values of X near zero do not throw 

suspicion on the hypothesis, n = 0. 

In a general sense (this point will be clarified in further dis¬ 
cussion) , one tends to accept a hypothesis about /z whenever the 
value of X computed from a sample is close to the value specified 
by the hypothesis, and to reject that hypothesis whenever the 
value of X is far from the specified value. In either accepting or 
rejecting a hypothesis, an investigator may make the wisest 
decision possible in view of the data and still be in error. How¬ 
ever, the nature of the procedures which will be now described 
are such that at least the probability of making large errors is 
kept at a low level. 

Level of Significance. To this point the word “exceptional” 
applied to the size of X has been treated rather vaguely. Actually, 
the decision as to what is to be regarded as exceptional is arbitrary. 
For a small sample like the one we have been considering, it is 
customary to regard the 5 percent of samples having the most 
extreme values of X as exceptional and to reject the null hy¬ 
pothesis when one of these samples has been drawn. This arbi¬ 
trary value (here selected as 5 percent) is called a level of signifi¬ 
cance, because it sets apart certain differences between sample 
mean and hypothetical population mean as being sufficiently 
significant to lead to rejection of the hypothesis. 

Having decided upon a level of significance, it is still necessary 
to decide how to use it. Shall we apply it to the right tail of the 
distribution and reject the hypothesis \x = 0 only when X is 
exceptionally large, or to the left and reject only when X is 
exceptionally small, or shall we regard both extremely large and 
extremely small values of X as exceptional? 

The answer to these questions depends upon what alternatives 
seem particularly important to guard against in view of the 
problem situation. If the investigator has started out with no 
opinion concerning the relative balance scores of the deaf or hearing 
children, he will wish to reject the hypothesis that ju = 0 should 
he find a value of t that is numerically very large, whether it is 
positive or negative. In other words, the alternative he will wish 
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to consider is simply: m is different from zero. Suppose he has 
selected .05 as the level of significance at which he will work. He 
will then decide to reject the hypothesis, n = 0, if he obtains a 
sample for which t exceeds or a sample for which t is less 
than t. m . The decision then is to reject the hypothesis n — 0 if 

a sample of 10 cases shows a value of t = which is smaller 

than -2.26 or larger than 2.26. This set of values of 1 is called 
the region of rejection, or the region of significance, or the critical 
region. The probability .05 corresponding to that region is the 
level of significance. The test we have just used is called a too- 
sided test or a two-tailed test because the region o rejection is on 
both sides of the probability curve (or in both tails of the cure). 

One-sided Test. Now if the investigator has enough prior 
information to entertain a strong belief that the mean balance 
score for hearing children exceeds that for deaf children, he m. y 
wlh to reject the hypothesis, „ - 0, should h.s sample show a 
large positive value of t. He will not consider it important, how¬ 
ever, to reject the hypothesis if a large negative value appears in 

the sample. (You will recall that, in Table 141 ' X ‘ s P °f 
for a pair if the score of the hearing child exceeds that of the 
deaf child ) Therefore, his region of rejection will consist of those 
values of 1 larger than = 1.83. This region is a one sided o 
one-tailed region, and the test is a one-sided test. Note hat 
level of significance is .05 in both situations, but that the location 

of the critical region differs. . . i hvDothesis 

The steps in formulation and testing a ^atis ea htpo he^ 

are stated in Table 14.2, first in generaterm; and then in term 

specific to the problem of comparing deaf and hearing persons 

in regard to balance. p,,: r : n(T pn-sps as 

Comparing the Means of Two| ®P U a y ‘ 'i al)orious an d time 
described for deaf and hearing child ■ - for whic h n0 

consuming and often when the investigator 

mates can be found. It is emp populations he 

fears that a random sample from each oUh^t 

wishes to compare might differ 1 P r ison. For- 

variable which would introduce bias in ■ j an( j i s 

tunately, in many problems this 
satisfactory merely to draw a separa 

population. .. - w whether sophomore men 

Suppose, for example, the ques , wit)l contemporary 

in a certain university are more familiar 
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TABLE 14.2 Steps in Formulating and Testing a Statistical Hypothesis 


Step 

General procedure 

Application to comparison of 
deaf with hearing 

1 

Select the measures on which 
the investigation will be based. 

Choose as a measure the balance 
score of a hearing girl minus the 
balance score of the deaf girl with 
whom she is paired. 

2 

Specify the general nature of 
the population and the parameter 
or parameters needed for the in¬ 
vestigation. 

The population of differences can 
be assumed to be approximately 
normal, and the parameter in ques¬ 
tion is its mean p. 

3 

Formulate a hypothesis about 
the population and decide on the 
alternatives. 

The hypothesis is /x = 0 and the 
alternatives are p is not equal to 0 . 

4 

Determine a statistic by which 
the hypothesis is to be tested. 

II 

• 

5 

Ascertain the distribution of the 
statistic. 

Student’s distribution with 
n = N — 1 degrees of freedom. 

6 ! 

Choose a level of significance. 

5 percent. 

7 

Determine the region of rejec¬ 
tion on the basis of the level of 
significance and the alternatives to 
the hypothesis. 

Reject p = 0 when t is larger 
than t.vn or is smaller than t. ok. 

8 

Draw a random sample of size N 
from the population. 

Determine differences in balance 
for a random sample of matched 
pairs. 

9 

Compute for this sample the 
value of the previously specified 
statistic. 

Compute t from the sample m 
Table 14.1 obtaining t = 2.83. 

10 

Determine whether the com¬ 
puted value of the statistic is in the 
region of rejection. 

Ascertain that Ms in the region 
of rejection because /.975 = 2.26 and 
the sample value t = 2.83 is larger 
than /. 97 s. 

11 

Reject the hypothesis if the 
value of the statistic is in the region 
of rejection; otherwise accept it. 

Reject the hypothesis p = 0. 


science than sophomore women. A random sample of men is 
drawn from the sophomore class and given the Science Section 
of the Contemporary Affairs Test of the Cooperative Test Service. 
By this device what may be presumed to be a random sample 
of science scores of sophomore men is obtained. Let us use N i, 
3Ti, and si to denote the number of cases, the mean and the 
standard deviation of this sample. 

A random sample of sophomore women is also drawn and given 
the same test. Let us use N z , X 2 , and s 2 to denote the corre¬ 
sponding statistics of this sample. 

If pi represents the mean of the population of science scores 



COMPARING THE MEANS OF TWO POPULATIONS 233 

achieved by sophomore men and Ms that of science scores of sopho¬ 
more women, the hypothesis to be tested is m. - Ms = 0. 

The statistic by means of which this hypothesis is to be tested 
is the standard score for the mean difference X! - X s. lo obtain 
this standard score, the standard error of the mean difference is 
required. Advanced texts give several different formulas for this 
standard error, applicable under different experimental conditions. 
The one given in formula (14.1) can be used only if A, and V, 
are fairly large, for example, neither one less than 25. If t e 
standard error of X, - X* is denoted sv,-.r„ then for large values 

of N 1 and N 2f 


(14.1) 




s/ 


S? 


_ + A 

Ni N t 


and the standard score for the mean difference is 

(Yi - %V) ~ (mi - M2) 

(u * 2) z= 

y Ni n. 

This standard score has been denoted a rather thanJ because it is 
alwavs referred to tables of the normal distribution. If \alues 
of N> and N , are so small that the investigator feels he should use 
Student’s distribution, then he needs an entirely different formula 

and should consult a more advanced text. , 

Aside from the fact that the statistic of formula (14.2) 1 - u 

of that of formula <13.5) .ltd that w. ™ 

pondcntly to m*. ”” “g*., fW 

the test of the hypothesis Mi “ H ~ u proccc y 

which were outlined in Table 14.2. i ovp i oi 

Suppose that a two-sided test and a significance level of 01 

have been chosen, and the pertinent sample data are found as 
follows: 


N 


8 2 


s*/N 


Men 

120 

7.57 

23.05 

.1921 


Women 

130 

6.40 

21.49 

.1053 


Then 


t/ 


il + if = v^3574 
A, As 


= .598 


(7.57 - 6.40) — 0 = ! 90 
.598 


and 


z = 
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The region of rejection is obtained from the normal distribution 
and consists of those values of z larger than z .995 = 2.576 or smaller 

than z .005 = -2.576. . 

Clearly, the obtained value z = 1.96 does not fall m the region 

of rejection. Consequently, the hypothesis that m - fit = 0 must 

be considered tenable, reasonable, and not contradicted by the 

data. . . 

A word of caution is needed at this point. To retam a hy¬ 
pothesis does not prove it true but merely indicates that it is not 
inconsistent with the observed data of a sample. Similarly, to 
reject a hypothesis does not prove it false but merely indicates 
that the sample data obtained are improbable if the hypothesis is 
true. Furthermore, if the hypothesis is not rejected, it is inap¬ 
propriate for the investigator to discuss why one sample had a 
higher mean than the other. The failure to reject the hypothesis 
Hi - /x 2 = 0 means simply that the difference in sample means 
may well have arisen through accidents of sampling from popula¬ 
tions with the same mean. It is wrong to say, “There was a 
tendency for men to have the higher mean but the difference was 
not significant.” Failure to reject the null hypothesis means that 
there is no convincing evidence of a tendency for the scores of 
men and women to be different. 

EXERCISE 14.1 

1. The same 10 pairs of deaf and hearing girls whose balance scores were 
presented in Table 14.1 had scores on strength of grip as follows: 

Pair 123456789 10 

Hearing 16 12 17 18 15 21 16 22 18 16 

Deaf 10 14 12 21 19 12 19 17 19 25 

Test the hypothesis that deaf and hearing do not differ in respect to 
strength of grip, the alternative being that they do differ, and .05 the 
level of significance. 

2. For all 37 pairs in Long’s study, 2X = 87.1 and 2X J = 363.17, where 
X is the balance score of a hearing child minus the balance score of 
her deaf mate. Test the hypothesis n = 0, the alternative being /x > 0, 
and .01, the level of significance. 

3. For all 37 pairs, the corresponding data for strength of grip are 2X = 86 
and 2A 2 = 2072. Test the null hypothesis, fi = 0, when the alternative 
is /x > 0 and .01 the level of significance. 

4. For all 37 pairs, the corresponding data for rate of tapping with best 
hand are 2X = 5.9 and 2X 2 = 982.55. Test the null hypothesis, ju = 0, 
the alternative being /x ^ 0 and .02 the level of significance. 



235 


EFFECTIVENESS OF A TEST OF HYPOTHESIS 

5. For all 37 pairs, the corresponding data for rate of tapping with right 
hand are 2X = 28.8 and 2X’ = 1595.52. Test the null hypothesis 
l = 0 against the alternative m * 0, using the .05 level of significance. 

Effectiveness of a Test of Hypothesis. In this section the 
various problems involved in testing a hypothesis such as choice 
of level of significance and choice of region of rejection will be 
reconsidered from the point of view of the relation of the hypothesis 

t0 The readtr^l recall that the decision to reject or to accept a 
hvDothesis involves a degree of arbitrariness. There is always a 
risk that a hypothesis may be rejected when it is true or accepte 
when it is false. The error in rejecting a hypothesis when it is 

true is known as an error of the first kind, or a >pe 

The error of accepting a hypothesis when it is false is known as 

an error of the second kind, or a Type err ° r ' , l ia bilitv 0 f 
What is called the level of significance is the probability of 

rejecting a hypothesis when it is, in fact, true^ J" ' ; ^“ g ™£ 
during the course of a great many experiments . > * n hi he * 
testing a true hypothesis, an investigator who » ‘ 

05 level of significance will falsely reject 5 percent of tho.e true 
hypotheses. !f he is employing the .01 level efface 
ill falsely reject 1 percent of the true hypo|theses he ests.. JThe 

level of significance is, therefore, the pro la i . 

Naturally one would like to make the protabrfrty oi z T pe 

*• >«• “ — ; iv'T, Zt .«>■ 

significance as low as possible. However the possi- 

because the choice of this leve i» ar 1 ™ r ; limited b ’ y t j ie ne ed 

bility of reducing the level of signi To gee this need, 

for rejecting the hypothesis when * so ^ there is n0 

suppose the level of s, g nlfica “ ' j { \ his is done , however, there 
possibility of an error of Type • u cveu when it is false, 

is no possibility of rejecting t yP , e the essence of 

There is then no point to the study at as a 

hypothesis testing is the utilization o ex hypothesis and its 

basis for a choice between the acceptance of a h> pothex.x 

^addition to the choice of 

*5 ^^ in one tail of the sampling 

distribution, or the other, or m bot tai• location of 

The choice of level of stance and the cho,^ ^ 

the region of significance (whi 
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can be evaluated in terms of the need for rejection of the hypothe¬ 
sis when it is false. 

The problems just posed can be approached directly in terms 
of the need for reducing the probability of a Type II error; that 
is, of reducing the probability of accepting a hypothesis when it 
is false. In the following discussion we approach the matter from 
the mathematically equivalent view of increasing the probability 
of rejecting the hypothesis when it is false. 

The Power of a Test. This discussion will involve comparison 
of statistical tests. It will be necessary to define the sense in 
which one test may be said to be better than another. To make 
this discussion clear, it will be necessary to state explicitly what is 
meant by a test of hypothesis. For the purpose of the following 
discussion, a test is defined when the following have been specified: 

(1) A statistic, 

(2) Sample size, 

(3) Level of significance, 

(4) Region of rejection. 

The reader should note that a test thus defined is only one of 
a variety of test types. It is known as a fixed-sample-size, two- 
decision test. One different type of test is the sequential type in 
which sample size is not specified in advance of the experiment.* 
In another type of test, known as a multiple-decision test, there 
are more decisions than the two of acceptance and rejection con¬ 
sidered here. 

For the fixed-sample-size, two-decision test with which we are 
dealing, a change in any one of the four elements constitutes a 
change in the test. Consider now a hypothesis and an alternative; 
for example: 

Hypothesis H: p = 0 

Alternative A: p = 1. 

Suppose a test has been selected to test hypothesis H. It is 
possible to determine the probability that this test will lead to the 
rejection of H in case H is false and A is true, that is in case p — 1. 
Now consider a second test obtained by changing the level of 
significance or changing sample size, or both. It is also possible 
to compute the probability that this second test will lead to the 
rejection of H : p = 0 if p = 1. If the second probability is greater 
than the first, we say that the second test is more powerful than 
the first for rejecting H when A is true. For each test separately 
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the probability of rejecting H when A is true is called the power 
of the test in that relationship between H and A. 

Table 14.3 shows the power of eight different tests for re¬ 
jecting p = 0 when /x = 1, and of the same eight tests for rejecting 
il = o when p = — 1. Consider first the alternative p = 1 and the 

TABLE 14.3 Power of Several Tests of the Hypothesis p = 0 against 

the Alternatives p = 1 and p = -1 


8 


= 10 


Alternative 

Level of 
significance 

Power of the test 

Two-tailed test 

Upper-tailed test 

N = 100 N = 400 

> 

n 

II 

o 

o 

> 

II 

M = 1 

M= - 1 

.05 

.01 

.05 

.01 

.17 -52 

.06 .28 

.17 .52 

.06 -28 

.26 .64 

.09 .37 

.00 -00 

.00 00 


four tests with sample size 100. Of the two-tai ed tests, the one 
with level of significance .05 is more powerful than the one with 
level of significance .01; the situation is similar for the two tests 
with critical region in the upper tail. These comparisons illustrate 
the principle that an increase in probability of a Type I ' 
leads to an increase in power. In other words, a decrease in the 
probability of a Type I error can be bought at the price 

increase in the probability of a Type II error. 

Another principle can be discovered by comparing the power 

of the two upper-tailed tests with the power o t IP 1 , 

two-tailed tests for testing the hypothesis m - ‘ 

alternative u = 1 In this situation the upper-tailed tests a e 

clearly the more powerful. The two-tailed^ ^t^a,'^equally 

powerful for the alternatives /i - 1 and M * _ . 

tailed tests have almost no power against t le a t l tailed 

In fact, if the hypothesis M = 0 is tested by use of tins uppu Uded 

test when „ is actually equal to -1, the false hypothesis M ~ 
almost sure to be accepted. This important point will be de 

veloped more fully below. in- 

If sample size is increased from 100 to 4 “’. ’ f lhe 

creased regardless of level of significance aiu ieg, _ j 

alternative value of p. However, for t ic a 11 « would 

the power is still so small that several more decimal places 

be required to distinguish the value from zero. 
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Fig. 14-1 Power Functions of Two-tailed Tests of Hypothesis H: /x = 0 for Two Levels 
of Significance and Two Sizes of Sample with s = 10. 


Power Function. The relationships in Table 14.3 are developed 
more fully in Figures 14-1 and^ 14-2. From these figures it is 
possible to read the probability of rejecting the hypothesis m - 0 
if some particular alternative is true. For each curve, possible 
alternative values of /x are given on the horizontal scale. For 
each such alternative (as, for example, the alternatives /z = 1 or 
fj. = - 1 shown in Table 14.3), the probability of rejecting the 
hypothesis fx = 0, has been computed and a point has been plotted 
with the alternative value of fi as abscissa and the corresponding 
probability of rejection (or power) as ordinate. Examining these 
graphs, the reader should note that on each of them, when /x = 0, 
the probability of rejection is exactly equal to the level of sig¬ 
nificance. He should also identify on the appropriate graph the 
point corresponding to each entry in Table 14.3. 

Certain general principles which are of importance in deciding 
whether to use a larger or a smaller significance level, a larger or 
a smaller sample, or a one-tailed or a two-tailed test, can now 
be made clear through further examination of these graphs. 
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POWER FUNCTION 

Look first at Figure 14-1 and verify the following probabilities 
of rejection when N = 100: 


If M = 0 

If fj. = .05 or - 0.5 
If/i=lor/x = — 1 
If/i = 2or/i=—2 
IfM = 3orM=-3 
If /x = 4 or n = - 4 
From the probabilities thus verified and from the corresponding 
probabilities for samples of 400, the following conclusion should 
now be clear. In general, when a two-tailed test is used, the 
probability of rejection increases as the true alternative deviates more 
widely from the value under the hypothesis. It pis near zero though 
not precisely zero, there is small probability of rejecting H. 
p = 0, but, when the actual value of p departs considerably from 


For test at 

For test at 

.05 level 

.01 level 
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FI„. 14-2 Power Pone,ion, of Upper-.oi.ed Te,., = ol Hypo.he.i, H : p = 0 for Two Level, 
of Significance and Two Sizes of Samples with s 
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zero, the test makes rejection of H: p = 0 almost a certainty 
This is a quite satisfactory state of affairs, for obviously in a 
practical situation it may be essential to reject a hypothesis about 
p when the hypothesized value is very wrong, and not essential 
to reject it when it is only slightly wrong. Now compare the proba¬ 
bilities of rejection shown above for the two levels of significance. 
For each alternative stated, the power is less when .01 is used as 
the level of significance than when .05 is used. In general, reducing 
the level of significance causes a reduction in the power of the test. 
In other words, an investigator who demands a very low proba¬ 
bility of rejecting a true hypothesis must pay for that caution by 
incurring a larger probability of not rejecting it when some other 
hypothesis is true. 

Now compare the probabilities of rejection listed above for 
N = 100 with corresponding probabilities read from the graphs for 
N = 400. For a fixed level of significance , power is increased as 
sample size is increased , and this relation holds for all alternatives 
to the hypothesis tested. In Figure 14-2 it is obvious that the 
probability of rejection is near one for alternatives much greater 
than zero, and for such alternatives the power of the one-tailed 
test shown in Figure 14-2 is greater than the power of the two- 
tailed tests shown in Figure 14-1. However, for alternatives less 
than zero, the upper-tailed test has scarcely any power. 

For the upper-tailed test, as for the two-tailed test, lowering 
the level of significance results in a reduction of power, and in¬ 
creasing sample size results in an increase of power for alternatives 
greater than p = 0. 

The practical consequences of the preceding discussion for the 
selection of tests of hypotheses will now be outlined. 

1. Sample size should be made as large as feasible without 
sacrificing the needs of good experimentation, to assure good power. 

2. If sample size is small because of the nature of the study, 
then the level of significance should not be very small. The justi¬ 
fication for this statement is that small sample size provides low 
power for a test. Since a low level of significance further con¬ 
tributes to reduction of power, the combination of small sample 
size and a low level of significance is not desirable. 

3. The use which is to be made of the information gained from 
an investigation plays a part in the selection of a test of hypothesis. 
Ordinarily, the hypothesis which is being tested is rather non¬ 
committal. It consists, for example, of a statement that two 
groups do not differ. If the purpose of the study is to arrive at a 
scientific truth, one would be reluctant to reject a noncommittal 
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hypothesis and thereby accept a more positive alternative, unless 
the evidence is strongly in favor of the alternative. In these 
circumstances, one is likely to adopt a low level of significance, 

say .01 rather than .05. 

However, if the purpose of the study is to choose some remedia 
action, the investigator would not wish to reject a method which 
may be helpful, even if the evidence in favor of the method is 
slight Under such circumstances, the investigator would use a 
higher level of significance - for example, .05 in preference to .01 
4 In the choice between a one-tailed test and a two-tailed 
test,'it is necessary to weigh the gains and losses involved from 
this choice On the one hand, the one-tailed test gives greate 
power against certain alternatives. On the other hand the two 
tailed test provides assurance that the hypothesis „ = 0 will be 
rejected for all alternatives differing considerably from zero, 
whenever the one-tailed test does not provide such assurance. 

The one-tailed test is chosen when certam alternatives 
either of no importance or no interest. For example, suppose a 
new medical treatment is being compared with one already in 
general use The hypothesis ‘' old and new treatments are equally 

EE sr. ss-t snsirrf 

Sjrrs: rrasrs sra 

test should be used^ n ; g intendcd for 

ErEEr rx iztzz 

puted and it may be disregarded by other,, m ^ K ^ 

solid line gives the sampling distribution of t = undcr 116 

hypothesis M - 0 when N m ™ 17 the'picture! the 

jection has been placed in P nr. the segment of 

shaded area represents the ~ 

line extending to the right from A 

includes values of - * which arc larger than 1.64.,, and 

is the region of significance; ^ llnin 

from A = 1.645 and marked *** mciuo 

1.645, and is the region of acceptance. 



" Distribution under the hypothesis /i = 0 
True distribution if /X“I 
mwxmhh Region of acceptance ^=1.64 
Region of reaction 

\//////////fy< Level of significance—.05 

x — H 

Fig. 14-3 Distribution of t = /y/\QQ unc *er the Hypothesis that /x = 0 and True 
Distribution if /x = 1. Region of Rejection Based on a One-tailed Test N = 100, s = 10. 


Suppose, however, that the true value of n is actually n = 1 
and not, as hypothesized, n = 0. Then the actual sampling 

X - i _ 

distribution is the distribution of- - -= X — 1, and is given 


by the dotted line in Figure 14-3. The regions of rejection and 
acceptance, being based on the hypothetical distribution, are the 

lines marked-and x x x as already described. The probability 

that a sample will show a value of Y in the region of rejection is 
represented by the area under the actual sampling distribution 
(the dotted line) to the right of the ordinate AB. But the point. 
A is 1.64 - 1.0 = .64 units above /x - 1, and so the probability 
sought is the area under a normal curve to the right of an ordinate 
at z = .64. (The sample size is large enough to make the use 
of the normal probability distribution appropriate.) This area 
is .26. Hence, the probability of rejecting the hj r pothesis /x = 0, 
when fx is actually 1, is .26. This is the power of the one-sided 
test against the alternative n = 1. In Figure 14-2, the test for 
At = 0 at significance level .05, when 1 V = 100 and s = 10, is shown 
in Section A. The ordinate at n = 1 can be read as .26. 

In similar fashion, the power against the alternative n = 1.5 
can be computed as .44, against the alternative fi = 2.0 as 
.63 and against fi = 3.5 as .97. 

The computation shown above was made particularly simple 
because s/ VX came out so conveniently as 10/10 = 1, but in Sec¬ 
tion B of Figure 14-2 s = 10 and N = 400. Then the region of 
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- Distribution under tbe hypothesis /i-0 

—-— True distribution if /i = l 

******** Region of acceptance A =*.82 
Region of rejection 
W////////A L«v*l of significance =.05 

Fi,. 14-4 Distribution of 1 = ^ under the Hypothesis thot p = 0 end True Dis¬ 
tribution if p = 1. Reoion of Rejection Bosed on o One-loiled Test N = 400, s = 10. 

significance consists of values of —= 75 S reater than 164 ’ 

which means values of X greater than .82. The hypothetical and 
actual distributions now appear like Figure 14-4, and the proba¬ 
bility sought is the area under the actuaUurve to the right the 

ordinate at A. But the point A is 1( y y 40 o 

standard deviation units from the alternative P = 1- The area 
under a normal curve to the right of J = - .36 is .64. Compa e 
£ result with the ordinate at p = 1 of the power curve in Sec- 

ti0 F!; that same one-tailed test of p = 0 (AT - 400, s = 10 and 

significance level of .05), you may wish to verify by “™P" tat “ 
that the power against the alternative p = .25 s 13, against h 
alternative p - .75 is .44; against the alternative p - 15 «a .01. 





Inferences about 
Proportions 


Methods of statistical inference for means have been developed 
in the two preceding chapters. Now we shall consider similar 
procedures applicable to proportions. 

Meaning of Proportion. The word “proportion 1 ” is used here 
to mean the fractional part of a group of discrete individuals. 
Thus, if a survey of 800 dwelling units should find 128 units sub¬ 
standard, the proportion of substandard units could be expressed 
as or as any other equivalent fraction such as ^ or or .16. 
The percent of substandard units, however, can be expressed in 
only one way, namely as 16 percent, since percent means literally 
per hundred (per centum). 

In ordinary speech the term “proportion” is also applied 
to the fractional part of a continuous quantity, as, for example, 
the proportion of milk sold in New York City which is supplied by 
a particular distributor, or the proportion of a teacher’s time which 
is spent on routine tasks. The methods described in this chapter 
are not applicable to proportions of this sort. 

One other correct use of the term “proportion” to which the 
methods of this chapter cannot be correctly applied should be 
mentioned. Consider a psychological test in which subjects make 
free responses which are then classified by judges into categories. 


Suppose these results have been obtained: 

Number of subjects examined 150 

Number of responses made in all categories 3000 

Number of responses in category A 420 

Number of subjects who gave no responses in 

category A 48 


1 The word "proportion" ia used here with a meaning somewhat different from that 
given to it in textbooks in mathematics where it is defined as the equality of two ratios. 
The meaning attributed to the word here has become standard usage in statistical 
writing. 
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It is possible to make inferences about the proportion of subjects 
who give no responses in category A and to use as datum the 
statistic = .32. It is not possible to use the methods of this 
chapter to make inferences about the proportion of responses 

falling into category A based on the statistic 3 Vo% - - 14 - ■ beea "“ 
such responses are not independent individuals subject to e 
processes of random sampling. Samples are not drawn from 
population of responses but from a population of subjects 

Conversion of Dichotomous Data to Measurement Data. 
Methods of statistical inference for proportions are very smular to 
those for means, which are already familiar. The reason for this 
similarity will be clarified if we note how a dichotomy can be trans¬ 
formed into a special kind of measurement by assigning numerical 

values to the two classes in the dichotomy. „ , , 

Consider for example, the data reported in Appendix Table 

VIII The six test scores reported are scaled data, and sex is a 

dichotomy^ We shall assign the numbers 0 and 1 to the two sex 

categorieJ It is immaterial which number is assigned to which 

caKy but* as some choice must be made, we shall score 
category, dul j indicates presence in a 

women as 0 ana men as 1 . . ^ 

class (males) and 0 means absence from that class. For the first 

10 students from Table VIII we have the following tabulation. 


Code number of student: 
Sex of student: 

Value of X 


1 

M 

1 


2 

M 

1 


3 

M 

1 


4 

M 

1 


5 

F 

0 


6 

F 

0 


7 

M 

1 


8 

M 

1 


9 10 

M F 
1 0 


By counting entries in the second line we see that thereare 

, th, .0 — y '.r:° IX1.“ .»d 

By adding entries m the third line 

* M ! = 7 Clearly the arbitrary assignment of the num- 

J. eld !°t. th. »d «' th “,f “ y ■ h0 ”‘ “ ‘ 

proportion •> ^ "‘‘proportion of »om«» .mopp 

■« - * ■ 1 *“ d 

male X - 0, then 2X 3 and • • f will be use d here 

Symbolism for Proportions. The symo p have a 

to represent the proportion of in ivi .. w |,j c ), ave in a 

specified characteristic (that is, e proportion which do not 
given class) and 1 - P to represent the P ^, The letter 

have that characteristic (are not in t^B ^ ^ individuals in a 
q is often used in place of 1 V * 
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sample, then Np is the number which have the given characteristic 
and N(1 - p) or Nq is the number which do not have it. 

For the population from which the sample is presumed to be 
drawn, we shall use capital P to represent the proportion in the 
given class and 1 - P or Q to represent the proportion not in the 
class. The Greek letter tt which is analogous to p will not be used 
for two reasons: (1) it is in familiar use to mean the number 
3.1416 and, (2) the Greek alphabet has no letter analogous to 
our Q. 

Statistical Inference about Proportions. The parameter P 
of a dichotomous population is analogous to the parameter p of 
a population defined on a continuous variable. Therefore the 
four main problems of inference about p described in Chapters 13 
and 14 suggest the following analogous problems about P: 

(1) To find a point estimate for P; 

(2) To find an interval estimate, or confidence interval, for P ; 

(3) To test the hypothesis that P has some specified value; 

(4) To test the hypothesis that two populations have the same 
value of P } regardless of what that value may be. 

Population Distribution. In making inferences about means, 
it was assumed that there was a population distribution of some 
variable X with population parameters p and <r. The sample 
statistic A r , which will vary from sample to sample, must provide 
the basis for inferences about p } as p itself is almost always un¬ 
known. In the problems of Chapters 13 and 14, the variable X 
was always a scaled variable and always treated as if con¬ 
tinuous. 

In the problems of our present discussion, the population dis¬ 
tribution is a dichotomy in which the proportion of individuals 
in one class is P and in the other, 1 — P. This population has 
a mean and a standard deviation also, and it can be shown by a 
little mathematics that these are 

(15.1) Mean = ^ = p 

(15.2) Variance = a 2 = P(1 - P) 

(15.3) Standard deviation = a = VP(l — P) 

One notable difference now appears between the populations 
considered previously and the two-class populations which we are 
now considering. In the former, p and a are quite unrelated, 
knowledge of one gives no information as to the value of the other. 
However, in these two-class populations, <r depends completely 
on P. 
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SAMPLING DISTRIBUTION OF A PROPORTION 

Fifnire 15-1 illustrates a dichotomous population in which 
p = .5 and consequently <r = VJUE) = .5. Figure 15-2 illustrates 
a dichotomous population in which P = .8 and consequently 

a = VM2) = .4. 



pj g# 15-1 Population Distribution 

when P = .5. 



Fig. 15-2 Population Distribution 
when P = .&. 


Sampling Distribution of a Proportion Suppose we draw a 
random sample of 2 persons out of a population mwluch tl e P™' 
portion of men is P = .5. The sample may have no men, l man, 

or 2 men, and the corresponding proportions will be P = 2 “ °’ 

p _ l = .5, and V = ? = l.°. Thus if N = 2, p can take one of 

three different values and the sampling distribution of p is a 
distribution with three classes. The probabilities attaching to 

-s s.:, k rzrt cr « 

one of N + 1 values ranging from P - 0 to p # 1,00 » 

and so the sampling distribution of p is a + * 

- ttsssas 

sTtiS »;■ 5 ».d -- ..- 

, a, the 
1&-30. 
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purposes unless N is small or P is near 0 or near 1. Figure 15-3 
shows the exact sampling distribution for p when P = .5 and 
N = 10, and also shows a normal curve placed over the exact 


Probability 

density 



Fig. 15-3 Exact Sampling Distribution for p when P = .5 and N = 10 and Normal 

Distribution with Mean = .5 and Standard Deviation = \/ — = .158. 

V 10 

distribution. Figure 15-4 is similar except that P = .8. The 
relationship between the normal distribution and the sampling 
distribution for proportions suggests that areas of the former 


Probability 

density 



may be used as approximations for areas of the latter, to simplify 

computation. Ways in which the approximation can be used 

effectively will be described below. Methods for exact calculation 
will also be developed. 

Some features of the normal approximation need to be pointed 
out. 

1. The normal approximation improves as sample size in¬ 
creases. 

2. For given N the goodness of approximation depends on P. 
The normal approximation is useful for the distribution of p from 
samples of 10 when P = .5, but is very poor for samples of the 








INTERVAL ESTIMATION OF A PROPORTION M ’ 

same size when P is near 0 or near 1. A rough rule of thumb is 
!w the normal approximation is satisfactory for given P, if 

i • * cuf'h NP = 5 or more when P is equal to or 

sample size N is such that ivr , , , . i p5 ,ct 50 

less than 0.5. Thus if P is 0.1, then N should be at least 50. 

For P more than 0.5, apply the same rule to Q - 1 r. 

3 Because the normal approximation involves fitting a curve 

sssrS 

for the mean, variance, and standard deviation of the distribution 
of p for samples of size N are needed. 

= P 

(15.4) Mean of p = 

2 = P(1 - P) 

(15.5) Variance of p “ N 


st 


(15.6) Standard deviation oi p - - y # 

i j • of n is usually spoken of as the standard 
The standard deviation ot p is usual y f 

error °f P- ^ .. nf proportion. When the unknown pro- 

provides the best single value ava>lable just as^sa, P ^ ? 

provides the best sing e es ‘™ a parameters. In other words, 
and X are unbiased ^ ^ P ^ q{ p> but the 

’.™ ..“ P X « »™"' »< ““ “ be ‘ ,m “ l 

exactly equal to the population vaue^ confidence interval 

Interval Estimation of a approximatio n to the 

for P will be obtained by us similar to the one already 

distribution of p. The P roce interval for p. The essential 
described in obtaining a con c e that it requires a 

difference in the procedure to be ^iDea 

correction for fitting a norma ™ rv con fidence coefficient c 

As before, we begin by, a d« , e of „ 0 rmal probabil- 

and reading the values zi±c and z^c 

ity. The confidence limits are then. _ 


(15.7) 


Upper limit = p + 2 N + Z 'T 
Lower limit = P “ 2 V + Zx ~Y 


p^T) 

‘ N 




-jd 

JV 


(15.8) 
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Here the quantity is a correction for fitting the normal curve 

to a histogram, usually called a correcti on for contin uity. The 
reader will also note that the quantity, Vp( 1 - p)/N , is an esti¬ 
mate of the standard error of p obtained by substituting p for P 

in formula (15.6). 

As an illustration, suppose a superintendent of schools wants 
an estimate of how many persons of voting age in his community 
would be in favor of a bond issue for a new school building, and 
suppose he selects a random sample of 200 such adults. (This 
selection is usually the hardest part of the whole undertaking and 
requires a great deal of careful planning.) Suppose an inquiry 
to these 200 voters reveals that 140 are in favor of the bond issue. 
Then the best single estimate of the proportion of voters who will 
vote affirmatively — assuming no major differences between 
what a voter says now and what he will do at a later date — is 

m = - 70 . . . 

If the superintendent wishes to make an interval estimate, he 
must choose a confidence coefficient. Suppose he chooses to let 


c = .95. Then, 

Upper limit = .70 + 


2 ( 200 ) 


+ 1.96 



Lower limit = .70 — 


1 

2 ( 200 ) 



With confidence coefficient .95 he can make the statement that 
.634 < P < .766. If he wants to make a statement in which he 
has still greater confidence, he can use a larger value of c. If he 
uses c = .99, his statement would be .614 < P < .786. Certainly 
he can feel quite confident that, unless there is a definite shift of 
opinion before election day, the vote will carry with a good 
majority. 

Test of a Hypothesis about a Proportion. To test a hypothesis 
about a proportion, we proceed in a manner similar to the one 
which was adopted for testing hypotheses about a mean. The 
procedure will be modified somewhat because of the necessary 
corrections for continuity. We choose a level of significance, 
for example, .05 or .01, and set up a region of rejection which may 
be in the upper tail, in the lower tail or in both tails. 

Suppose now that the hypothetical proportion is P, sample 
size is N, and the level of significance is .05. Then, the region of 
rejection for an upper-tailed test consists of values of p such that 
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(15.9) 


1 J p V~ P) 

V > P + 2 AT + z - 96 V N 


. z is the usual value referred to the tables of normal 
probability. Similarly, for a lower-tailed test, the region of sig- 

nificance would be 

(15.10) p <P- 

For the two-tailed region, 

p > P + 


(15.11) 

and 


< P- 


1 , 

/¥( 1 - P) 

2 N + V 

N 

we have both 

1 , 4 

/p( i - p) 

2N + Zmy 

/' N 

1 , 

/P( 1 - P) 

2N + Z '0“ 

V N 


on 
For 


- >» rrr s 

hypotheses on the basi aDD licable to problems such as 

individuals. The “ *h. P re- 

the comparison of deaf and^hearmg ^ ^ ^ ^ p 

C a1est g of ffhypothesl that the median of a population of dif- 

fer T C he procedure S wUi°be described in relation to ^scores 

balance of deaf and hearing gir *' thTsigns of the differences 

37 p.i„ .1 >"Wf“ “ rr“ F», 1, .hi. m- 

ST Co* e s S £ =• S. “ 

& “ - 94 ' .. „ tW in general, the deaf and hearing do 

On the assumption th , g lifferences would be assumed to 

not differ in balance, hal o ^ ^ proportion of positive dif- 
be plus and half minus. P u the hypothesis 

ferences in the population, then P 

t0 Usingfhe .05 level of significance and an upper-tailed region, 
the region of rejection is 


or 


_. . +1.645 

p > .5 + 2(35 ) T 

p > .65 




, 5)(1 - -5) 
35 



252 


INFERENCES ABOUT PROPORTIONS 


TABLE 15.1 Motor Abilities of 37 Pairs of Deaf and Hearing Girls* 


Pair 

number 

Tapping with 
right hand 

Grip 

Balance 

_____ 

Tapping with 
best hand 

D 

H 

D 

H 

D 

H 

D 

H 

1 

1.3 

25.7 

10 

16 

2.0 

2.3 

18.3 

25.7 

2 

17.0 

27.7 

14 

12 

2.0 

1.0 

17.0 

27.7 

3 

26.3 

26.0 

12 

17 

2.7 

3.7 

26.3 

26.0 

4 

29.7 

26.3 

21 

18 

2.7 

3.3 

29.7 

28.7 

5 

23.0 

26.3 

19 

15 

3.0 


23.0 

26.3 

6 

22.7 

24.0 

12 

21 

2.7 

2.7 

22.7 

24.0 

7 

22.0 

35.0 

19 

16 

3.7 

8.3 

22.3 

35.0 

8 

20.7 

28.7 

17 

22 

1.3 

6.0 

30.7 

28.7 

9 

29.0 

30.3 

19 

18 

2.0 


29.0 

30.3 

10 

30.0 

26.3 

25 

16 

4.3 

BfilB 

30.0 

26.3 

11 

26.7 

37.3 

23 

28 

1.0 

1.3 

26.7 

37.3 

12 

36.0 

30.7 

25 

22 

1.7 

4.0 

36.0 

30.7 

13 

30.7 

31.3 

24 

21 

5.0 

5.3 

30.7 

31.3 

14 

22.3 

25.0 

15 

21 

1.0 

2.0 

22.3 

25.0 

15 

38.3 

38.7 

25 

30 

1.7 

2.7 

38.3 

38.7 

16 

28.7 

24.0 

13 

25 

3.3 

4.7 

28.7 

24.0 

17 

34.3 

32.3 

25 

31 

3.0 


34.3 

32.3 

18 

35.7 

26.3 

23 

23 



35.7 

26.3 

19 

32.3 

30.7 

23 

35 

8.0 

10.0 

32.3 

33.0 

20 

33.0 i 

31.0 

15 

28 

2.3 

8.0 

33.0 

31.0 

21 

27.3 

36.7 

18 

33 

1.7 


29.3 

36.7 

22 

27.7 

31.7 

19 

23 

2.0 


27.7 

31.7 

23 

29.0 

28.7 

34 

20 

2.0 

7.0 

34.0 

34.0 

24 

35.3 

30.7 

32 

45 

2.7 


35.3 

30.7 

25 

30.3 

27.7 

29 

39 

1.0 


32.3 

27.7 

26 

39.0 

32.0 

26 

29 

2.0 

5.0 

39.7 

32.0 

27 

35.7 

28.0 

35 

32 

1.3 

4.0 

35.7 

28.0 

28 

31.7 

31.3 

31 

25 

2.3 


31.7 

31.3 

29 

37.7 

36.7 

29 

26 

4.3 


37.7 

36.7 

30 

39.3 

35.3 

31 

33 

5.3 

10.0 

39.3 

35.3 

31 

30.7 

31.3 

31 

38 

2.3 

3.7 

30.7 

32.0 

32 

26.7 

27.3 

22 

28 

2.0 

8.7 

26.7 

30.7 

33 

35.7 

33.0 

34 

19 

1.7 

3.7 

35.7 

33.0 

34 

34.0 

32.0 

22 

32 

3.0 

3.0 

34.0 

32.0 

35 

25.3 

30.3 

27 

29 

4.0 

7.0 

25.3 

30.3 

36 

29.7 

33.3 

31 

31 

4.7 

6.3 

29.7 

33.3 

37 

35.3 

29.3 

29 

28 

1.3 

7.3 

35.3 

29.3 


* Source of data: Long, op. cit. 


Hence, the hypothesis P = .5 is to be rejected if the observed 
proportion exceeds .65. Since the proportion in the sample is 
actually .94, the hypothesis is rejected. 

Applying a two-tailed test with a .05 level of significance, the 
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testing hypotheses about a propobtion 
region of rejection consists of all points p so that 

-■ 1 ■- 1 . 961 /® = - 68 


< . 5 - 


2 ( 35 ) 

1 


35 


and 


-1.96v/^lf^=- 32 


P < - 5 '2(35) - V 35 

riHS SSsS i 

• t Jnn thp basis of the two-tailed test as well. 

»y w °* h “ 

may be found in a^anced texts^ importance , is that non- 

Another advantage, t gh Uule work , results closely 

rihT^ed by tests based on the assumption 

pointed out, the methods for test ■ SYP approximate Exact 

tions described to tins poi tensive tables and mathe- 

methods are available bu r q t resu i ts is available 

™«c„l to** ““’i’SrS.I th,« p.*: A 

to Appendix T.M. XII. Th» ,„d C, to 

for the two-tailed test, B io are PP uitable for testing at the 
the lower-tailed test. A p entry in the table 

5 -percent level of significance > he number of 

indicates a region of rejectio y P ue of P at the head of the 

cases at the left of the row a The entries are stated m 

srASt’S *r. *< p =- 

s: x ft “ mr-u——- 

differences. f {W ta hl e , let us apply it to make 

As an example of the use The data a re N = 3o, 

the sign test of the previous section. 

k = 33, and P = 0.5. f to par t A, the column 

To make a two-tailed test _ The corresponding 

headed .50, and the row for which N - *>. 
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pair of entries is 11 and 24. This means that the hypothesis 
p = .50 is rejected if the number of positive differences is either 
as small as, or smaller than, 11 or is as large as, or larger than, 24. 
Since the observed value of k is 33, the hypothesis is rejected. 
If the numbers 11 and 24 are reduced to proportions, we have 
ii. = .314 which approximates the value .320 previously obtained 
and = .686 which approximates the value .680. 

To develop the corresponding test using a region of rejection 
in the upper tail, we refer to part B of Table XII. The entry 
in the column headed .50 and the row for which N = 35 is 23. 
Hence, the hypothesis is again rejected. 

Hypothesis that P Is the Same for Two Populations. Suppose 
that 105 of the 200 adults in the sample described earlier in this 
chapter have children in school and 95 do not; and suppose that 
90 of the 105 parents are in favor of the bond issue, while only 
50 of the 95 nonparents are in favor. The situation can be sum¬ 
marized thus: 



Parents 

Non parents 

Combined 

group 

Approve 

90 

50 

140 

Disapprove 

15 

45 

60 

Total 

105 

95 

200 


Two populations are under consideration here, a population of 
parents and one of nonparents. To distinguish these two popula¬ 
tions and the samples from them, we shall use the following 
symbols: 

Pi = proportion of the population of parents who approve 
the issue 

P 2 = proportion of the population of nonparents who approve 
the issue 

Pi = proportion of the sample of parents who approve the 
issue; p\ = 90/105 = .857 

p 2 = proportion of the sample of nonparents who approve 
the issue; p 2 = 50/95 = .526 

The question to be answered is whether the difference 
Pi - pi = .857 - .526 = .331 is so great that it cannot reasonably 
be presumed to be an outcome of random sampling from two 
populations which have the same value of P. In other words, 
do the sample data contradict the hypothesis P x = P 2 ? 
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hypothesis 

To answer this question, we need the standard error of the dif- 
ference pi — P 2 > which is 


( 15 . 12 ) 


^/ Pi(l - P ») + P 2 (1 ~ gg ) 


JVi ' 

If we call this estimate p, then for the given data 


90 + 50 _ 140 
p “ 105 + 95 200 


= .70 


, .. • ,i n i p _ p n the standard error is esti- 

Under the hypothesis that Pi - r 2 , t 

mated by the value 


( 15 . 13 ) 


s Pi-p» 


'pil - p) , PC ~ g) 

W, + N, 


If neither A', nor AT, is very small, tests of the hypothesis P, - P. 
can be based on the standard normal distribution. 

When the alternative is ft > ft, use the statistic 

N 

V^-Vi- 2 NiNi 

( 15 . 14 ) 


z = 


V® 


- V) , P( 1 -p> 
V, + A r 2 


and set the region of rejection in the upper tail. When the al¬ 
ternative is P* > Pi, use the statistic 

N 

P* " P» " 2 AW 

( 15 . 15 ) 


z = 


y/Sl 


- p) , P(* ~ P i 
— + A', 


and set the region of rejection in the upp jj e( j reg i on . For 

»•—* "“*>• •* u ' 

lower tail, use the statistic ^ 

vi - p* + 

( 15 . 16 ) 2 = 


v) i Ml_ El 

■wr + N > 


N 


The term 


_ is known as Fates' correction or the correction 
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for continuity , and is usually presented in a somewhat different 
form. Here, N = Ni + N** 

In the numerical problem introduced earlier in this section, we 
found pi = .857, p 2 = .526, p = .700, Ni = 105, N 2 = 95 , N = 200. 
Then, if the alternative is Pi > P 2 , we compute 



857 - .526 - 


200 


2(105)(95) _ ^321 


v/ 


(.7) (.3) (.7) (.3) 

105 ^ 95 


.0649 


= 4.9 


This computed z is larger than z .99 = 2.33 and the hypothesis is 
rejected. It must be concluded that the parents are more favorable 
to the bond issue than the nonparents. 


EXERCISE 15.1 

1 . Apply the sign test to the data of Table 14.1, using an upper-tailed 
test and the .05 level of significance. Use both the normal approxima¬ 
tion and Appendix Table XII. 

2. Apply the sign test to the difference in the scores of grip recorded in 
Table 15.1. Are the results consistent with the answer to Question 3 
in Exercise 14.1? 

3. A group of 50 laboratory animals was divided into two random groups 
of 25 each, and each group was trained in a complex task by a different 
method. At the end of a training period, 18 animals in group A and 
12 in group B had mastered the task. Test the hypothesis that the two 
methods are equally effective. (Consider that the two methods of 
training generate two populations.) 



The general procedures of statistical inference were developed 
in Chapters 13 and 14 and applied to problems about means. 

„ Chapter 15 similar methods were applied to proportions. 
This chapter will introduce problems of statistical inference re- 

C ° e “te f Population. The correlation coefficient, r, which we 

4SW* v 0 ’ % - r:f—'Sr 

rrZSff for statistical inference about 

variate.- .. A\ft&r from eflch other with 

Just as univariate P°P" ^er^from^ ^ bivBriate 

respect to mean, spread, ’ , .. differences are much 

populations differ from each other, an chara cteris- 

more complex, because we -^^f^rlha actenstics of the 
tics of each variable separately, but also 

joint variation of the two variab es. bivariate population, 

We shall discuss briefly ^"Iht £opuE each variable, 
namely, the normal b ‘ va ™ le - ch Ls defined separately by its 

taken separately, is normal, a Hit ion the joint variation 

mean and standard deviation n adjtton, ^ u the 

of the two variables is described by a parame 
correlation in the population. 

237 
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Because a bivariate population involves two variables, the 
means and standard deviations of the population on those variables 
must be distinguished by appropriate symbols. Thus, if the two 
variables are called X and Y, the sample statistics would be 
X, Y, s X f s u , and r, and the population parameters p x , n Vi <r x , cr y , 
and p. If the two variables are called Xi and X 2 , the sample 
statistics would be X\, X 2 , s,, s 2 , and r, and the populations param¬ 
eters pi, p 2 , <r h a 2 , and p. The Greek letter p (rho) corresponds 
to the English r and so is an appropriate symbol for the correla¬ 
tion coefficient in the population. The reader should be warned 
that this symbol is also in popular use to mean a coefficient of 
rank-order correlation. 

Distribution of 7. The distribution of r, like that of the pro¬ 
portion, is not normal but approaches normality as sample size 
increases. However; the approach to normality depends not only 
on sample size, but also on the value of p. If samples are drawn 
from a population for which p is at or near zero, the distribution 
of r is approximately normal for samples even as small as 30. 
However, when p is near -f 1 or - 1, the distribution of r is quite 
skewed even for very large samples. We shall describe below a 
way of meeting this problem. 

Standard Error of r. The standard error of r is given "by the 
formula 


(16.1) 


1-P 2 

VN - 1 


We may note that the standard error of r, like that of a proportion, 
depends on its population parameter. This fact and the non¬ 
normality of the distribution of r limit the usefulness of the 
standard error of r. 

Tests of the Hypothesis that p = 0. If p = 0, the standard 

error of r given in formula (16.1) reduces to * and the 

VN - 1 

standard score for r, which is -—-> becomes rVN - 1. Then, 

& r 

if N is no t small, for example, not less than 30, the distribution of 
rVN - 1 will approximate the standard normal. 

Suppose we wish to test the hypothesis p = 0 with a two- 
sided test and significance level .05. A table of the normal dis¬ 
tribution indicates that the region of significance is z > 1.96 and 
z < — 1.96. Now suppose a sample of 35 cases is drawn, and in 
this sample r is found to be .30. Therefore, z = rVN - 1 = .30V34 

0 
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= 1.75. As this value is not in the region of significance, the 
hypothesis p = 0 is tenable, and the observed value of r is non¬ 
significant. 

The test just described is not exact because, even when p - 0, 
the distribution of r is not exactly normal. An exact test is avail¬ 
able because 


( 16 . 2 ) 


rv/F^~2 
Vl - r 2 


has the t distribution with N -2 degrees of freedom. Thus for 

.30\/33 * /33 

N = 35 and r = .30, t = 


= .30 


= 1.81. When we 


Vl - (.30) 2 T 91 
refer this value to Appendix Table V with 30 degrees of freedom 
(since Table V does not have entries for 33 degrees of freedom), 
we find it well below t. m = 2.05. Again, we cannot reject the 

hypothes^p - y ° pothesis p _ o by Use of Appendix Table VI. 

This appendix table enables one to test the null hypothesis 
about p directly from the known value of r without farther 
computation. It is entered with degrees o reec om n , - 

(For a partial correlation, not discussed here, n would be some¬ 
thing else.) By reference to this table, verify the follow mg. 

(1) N = 22. Then n = 20, t. k <, = - .423, r .„s = .423 

In a sample of 22 cases, any value of r nt j‘ bet ^ C , 
r = - 423 and r = .423 would be significant at the . 0 o level 

and would justify rejecting the hypothesis P - 0. 

(2) N = 52. Then n = 52, r.o->& - - .273, r .975 — . 

Ina sample of 52 cases, the value of r which » 
greater than .273 would be significant at the . 0 o le e . 

f3) N = 92. Any value of r which is numerically greater 

than .205 would be significant at the . 0 okn-el; 
numerically greater than .24-, bigm lean ‘ , ’ 

any value numerically greater than .33S, .significant 

.001 level. , 

The z, Transformation. When p U different 
sampling distribution of r is skewe , an " e ^ i d A 

scribed in the previous section - Tabk V!^a ^ ? ^ 
variable, which is usually called z . • . j iave a l- 

order to distinguish it from the ot ler van a ) * formula 

ready been denoted by that letter, is related to r b> 


( 16 . 3 ) 


1 , 1+r 

Zr = 2 Y^r 
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(16.4) = 1.1503 log,. 

(16.5) = 1.1503 Qogio (1 + r) - logio (1 - r)] 

This variable, introduced by R. A. Fisher, has two very great 
advantages. Its distribution is approximately normal even for 
small samples in which p is near 1, and its standard error does not 
depend on the unknown population value but only on the size 
of the sample, 


(16.6) 


_ 1 
* r_ VW ^1 


If f (zeta) is the population value of z r corresponding to p, then 
(z r - f) VN - 3 may be treated as a normally distributed variable. 

The transformation from r to z r and vice versa can be made 
easily by reference to Appendix Table VII. For example, suppose 
r = .85. Look in the field of the table for the number nearest to 
.85. It is .8511, standing in the row headed 1.2 and in the column 
headed .06. The value of z T corresponding to r = .85 is therefore 
read as 1.26. Or if z T = 2.09, the corresponding r will stand in a 
cell in the row 2.0 and column .09. The value found in that cell 
is r = .9699. You will note that small values of r and z r are very 
similar, but for large values, z r is considerably larger than r. 

Test of Hypothesis about p Using z T Transformation. Suppose 
we have r = .76 for N = 228, and we wish to test the hypothesis 
that p = .80 at significance level .05, against the alternative 
p < .80. A correct procedure would be as follows: 

(1) From Table VII, we find that if r = .76, z = 1.00; 

(2) From the same table we find that if p = .80, = 1.10; 

(3) = V225 = 15; 

(4) Then the standard score is 


1.00 - 1.10 

z = - ■ - 7 - - = - 1.50 

(1/V225) 

(5) As this is a one-sided test, the critical region is z < z. 05, 
which is z < - 1.64; 

(6) Since the computed value z = — 1.50 is not in the critical 
region, the hypothesis p = .80 is not rejected. 

We may contrast with this the decision which might have 
been incorrectly obtained by computing a r = .024 and 

(r - p) 


= - 1.67 
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The computations are not incorrect, but as we have no 
probability distribution to which the statistic -1.67 can be re¬ 
ferred, they lead nowhere. If now we incorrectly refer it to the 
normal probability distribution and reject the hypothesis p .80 
because - 1.67 < *.«, we shall make an incorrect decision 

Confidence Interval for p. The a transformation is useful in 
obtaining a confidence interval for p. The procedure is as follows. 

(1) In Table VII, find the value of z corresponding to the 

sample value of r; , 

(2) Compute a confidence interval for the population value f 

by the formula 

(16.7) Upper limit = z r + zi+< j VN - 3 
Lower limit = z, + z i-c j y/ N — 3 

Here one must remember that z T and Zi±c (or Zi-c) are two quite 
different values represented by the same letter only because of 

(3) of the bounding values of f. From these form the in- 
terval for p. r = 65 in a sample of 40 cases, 

Thus suppose we h ^e obtained ^ coefficient 

and we want an interval estim normal proba- 

.95. From Table VII we read z r = .78. From the non p 

bility table we find _>—- 

z nl = - 1.96 and - 1-96. V7TH = V37. 

1-96 , - 7 « + i£2 or .46 < f < 1-10 

Then .78 - -^= < f < 78 + V37 

j .43 < P < -80 

Have the Same Correlation. nomilations the difference 

two independent, normal nvaria f - s transformed 

r, - Ti should fluctuate around* around fl - as 

to z, the difference z, - z* will * 
mean and will have as standard deviation 


1 


1 
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(16.8) 


(zi - Z2) - ~ fo) 


1 


Ni - 3 1 N 2 - 3 . 


may be treated as a normal deviate. If the hypothesis that 
fis rejected, the hypothesis pi = P 2 must also be rejected. 
If the hypothesis ft = ft is sustained, the hypothesis Pi = P 2 
must also be sustained. 

In his research concerning Children’s Collecting Activity 1 Durost 
studied the collections made by 50 boys and 50 girls between the 
ages of 10 and 14. Among the correlations obtained was a correla¬ 
tion between mental age and the average rating of the child’s 
collections (each of the collections of each child being rated as to 
quality and the average taken for the child). For boys this 
correlation was .31 and for girls, .06. Do these figures provide 
evidence that the relation between mental maturity and quality 
of collections made is higher for boys than for girls? 



r 

Zr 

A - 3 

1 

N - 3 

Boys 

.31 

.32 

47 

.021277 = o\ B 

Girls 

.06 

.06 

47 

.021277 = 



zb — 2a = *26 


.042554 = ol B + & 



.26 

.26 

1.26 



V.0426 

.206 


With so small a value of z, it would be inappropriate to assume 
that the relationship is higher for boys than for girls. 

Test of the Hypothesis that p vt = p xz when Computed for the 
Same Population. This situation is quite different from that 
of the preceding section, though often confused with it. There 
we had measures on the same two variates, X and Y, for two 
different populations. Here we have for one population measures 
on three variates, A r , Y , and Z, and we wish to know whether 
Z is more highly correlated with A r than with Y, or vice versa. 
This question is particularly important when two predictors are 
available, but the one showing the higher correlation with the 
criterion is more expensive, more time consuming, or more 
difficult to apply. For example, suppose that a prognostic test, 
long and rather difficult to administer, has yielded a correlation 

' Walter X. Durost, Children’s Collecting Activity Related to Social Factors, New 
York. Teachers College, Columbia University, Bureau of Publications, 1932. 
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of .56 with freshman college marks while ai short*r 
to administer, has yielded a correlation of .43, he two^corre^ 

use only the shorter test. 

Let X = score on the longer test 
v = score on the shorter test 
z = average mark at end of first semester 

A solution of the problem of the sign^ce 

between r y , and r z , can be o tame ^ ^ ^ the popu i a tion, but 

the form of distribution of . on j y to a subpopula- 

sr?^r:^fsrr^rx- * - 

each value of Y, with common variance 


(N - 31(1 + r„) 

h- **».•. mm. -*»- ? srlStr. 


(16.9) Then t = ( r xt - r vi ) 


IV? n^yi) 


Student’s distribution w> n » - ”7 correlatlon between the 

Obviously it is necessary answered. Assume that 

two tests before the question can be ansueie 

Then' 52 * , = (.56 - .43) VM = 1.61 

TH n i„p of t it would be difficult to make a 

In view of this small value of t, . f on r Qne test is to be 

rr ~ v*. »..£ '* 

' ■ r. 

exist for interpreting the size of a u . n iic<l 'large” 

much confusion has arisen because what ^ iJruW „„ a 

in one connection would 3 >frwii*iits in one particular range 

numerical scale and to say t m col ilii.s confusion, 

are large and in another negligible only a 
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A value of r may be “large” because it is too large to be consistent 
with the null hypothesis p = 0; or because it is large enough to 
be useful for predictive purposes; or because it is larger than 
values commonly obtained in similar circumstances. To use a 
single word “large” to cover these three quite disparate meanings 
is sheer verbal laziness. We shall now look at each in turn. 

1. Interpretation in terms of statistical significance. To say 
that a sample value of r is significant at, for instance, the .01 
level means that, if a large number of samples of the given size 
are drawn from an uncorrelated population (i.e., a population in 
which p = 0), 1 percent of such samples would be expected to 
produce a value of r at least as large numerically as the observed 
value. This is the concept which has been under discussion in 
the preceding pages. A very small r may be significant if the 
sample is very large. Thus, for 500 cases, r = .09 is significant, 
but still too small to be useful for most purposes. It must always 
be kept in mind that the terms “significant” and “nonsignificant” 
are really comments on the relation of a sample to the hypothesis 
which is being tested. 

2. Interpretation in terms of usefulness of the regression equation 

for predictive purposes. Chapter 9 explained that the usefulness 
of a regression equation depends upon how closely the estimated 
and actual values agree. Chapter 10 explained that the correla¬ 
tion coefficient provides the clue to this agreement. Formula 
(10.1) can be rewritten as ' 


(16.10) 


1 -r 2 = 


2(F- ?y 

s {Y -Yy 


Here we see that 1 - r 2 expresses the proportion which the sum 
of squares of deviations from the regression line is to the sum of 
squares of deviations from the mean. The smaller this proportion, 
the better the prediction. 

3. Interpretation in terms of correlations commonly obtained. 
Sometimes, when a research worker comments that he obtained a 
high correlation between two variables, he means only that r was 
higher than previous experience had led him to anticipate. So 
many factors affect the size of r that it is well nigh impossible to 
draw up a list of typical values which would not be misleading. 
To build up a sense of what is reasonable to expect, a researcher 
requires experience, familiarity with the research literature in his 
own field, and careful attention to factors which make one situa¬ 
tion different from others with which it is being compared. 



.87 

.38 

.39 


.41 

.42 

.43 

.44 


.1358 

.1405 

.1443 

.1480 

.1617 

.1664 

.1591 

.1628 

.1664 

.1700 

.1736 

.1772 

.1808 

.1844 

.1879 


.3637 

.3821 

.3605 

.3589 

.3572 

.3555 

.3538 
































TABLE I Ordinates and Areas of the Standard 

Normal Curve. Concluded 


(In terms of a units) 


m 

Area 

Oroinate 

X 

Area 

Ordinate 

X 

Area 

Ordinate 

€ 


c 



9 



1.60 

.4332 

.1295 

2.00 

.4772 

.0540 

2.50 

.4938 

.0175 

1.61 

.4345 

.1276 

2.01 

.4778 

.0529 

2.51 

.4940 

.0171 

1.52 

.4357 

.1257 

2.02 

.4783 

.0519 

2.52 

.4041 

.0167 

1.63 

.4370 

.1238 

2.03 

.4788 

.0508 

2.53 

.4943 

.0168 

1.64 

.4382 

.1219 

2.04 

.4793 

.0498 

2.64 

.4945 

.0158 

1.66 

.4394 

.1200 

2.05 

.4798 

.0488 

2.55 

.4946 

.0154 

1.56 

.4406 

.1182 

2.06 

.4803 

.0478 

2.56 

.4048 

.0151 

1.57 

.4418 

.1163 

2.07 

.4808 

.0468 

2.57 

.4949 

.0147 

1.68 

.4429 

.1145 

2.08 

.4812 

.0459 

2.58 

.4951 

.0143 

1.60 

.4441 

.1127 

2.09 

.4817 

.0449 

2.69 

.4952 

.0189 

1.60 

.4452 

.1109 

2.10 

.4821 

.0440 

2.60 

.4953 

.0136 

1.61 

.4463 

.1092 

2.11 

.4828 

.0431 

2.61 

.4955 

.0182 

1.62 

.4474 

.1074 

2.12 

.4830 

.0422 

2.62 

.4956 

.0129 

1.63 

.4484 

.1057 

2.13 

.4834 

.0413 

2.63 

.4957 

.0126 

1.64 

.4495 

.1040 

2.14 

.4838 

.0404 

2.64 

.4959 

.0122 

1.66 

.4505 

.1023 

2.15 

.4842 

.0395 

2.65 

.4960 

.0119 

1.66 

.4515 

.1006 

2.16 

.4846 

.0387 

2.66 

.4981 

.0116 

1.67 

.4525 

.0989 

2.17 

.4850 

.0379 

2.67 

.4962 

.0113 

1.68 

.4535 

.0973 

2.18 

.4854 

.0371 

2.68 

.4963 

.0110 

1.60 

.4545 

.0957 

2.19 

.4857 

.0363 

2.69 

.4964 

.0107 

1.70 

.4554 

.0940 

2.20 

.4861 

.0355 ' 

2.70 

.4965 

.0104 

1.71 

.4564 

.0925 

2.21 

.4864 

.0347 

2.71 

.4966 

.0101 

1.72 

.4573 

.0909 

2.23 

.4868 

.0339 

2.72 

.4967 

.0099 

1.73 

.4582 

.0893 

2.23 

.4871 

.0332 

2.73 

.4968 

.0096 

1.74 

.4591 

.0878 

2.24 

.4875 

.0325 

2.74 

.4969 

.0093 

1.75 

.4599 

.0863 

2.25 

.4878 

.0317 

2.75 

.4970 

.0091 

1.76 

.4608 

.0848 

2.26 

.4881 

.0310 

2.76 

.4971 

.0088 

1.77 

.4616 

.0833 

2.27 

.4884 

.0303 

2.77 

.4972 

.0086 

1.78 

.4625 

.0818 

2.28 

.4887 

.0297 

2.78 

.4973 

.0084 

1.70 

.4633 

.0804 

2.29 

.4890 

.0290 

2.79 

.4974 

.0081 

1.80 

.4641 1 

.0790 

2.30 

.4893 

.0283 

2.80 

.4974 

.0079 

1.81 

.4649 

.0775 

2.31 

.4896 

.0277 

2.81 

.4975 

.0077 

1.82 

.4656 

.0761 

2.32 

.4898 

.0270 

2.82 

.4976 

.0075 

1.83 

.4664 

.0748 

2.33 

.4901 

.0264 

2.83 

.4977 

.0073 

1.84 

.4671 

.0734 

2.34 

.4904 

.0258 

2.84 

.4977 

.0071 

1.85 

.4678 

.0721 

2.35 

.4906 

.0252 

2.85 

.4978 

.0069 

1.86 

.4686 

.0707 

2.36 

.4909 

.0246 

2.86 

.4979 

.0067 

1.87 

.4693 

.0694 

2.37 

.4911 

.0241 

2.87 

.4979 

.0065 

1.88 

.4699 

.0681 

2.38 

.4913 

.0235 

2.88 

.4980 

.0063 

1.80 

.4706 

.0669 

2.39 

.4916 

.0229 

2.89 

.4981 

.0061 

1.00 

.4713 

.0656 

2.40 

.4918 

.0224 

2.90 

.4981 

.0060 

1.91 

.4719 

.0644 

2.41 

.4920 

.0219 

2.91 

.4982 

.0058 

1.02 

.4726 

.0632 

2.42 

.4922 

.0213 

2.92 

.4982 

.0056 

1.93 

.4732 

.0620 

2.43 

.4925 

.0208 

2.93 

.4983 

.0055 

1.64 

.4738 

.0608 

2.44 

.4027 

.0203 

2.94 

.4984 

.0053 

1.05 

.4744 

.0596 

2.45 

.4929 

.0198 

2.95 

.4984 

.0051 

1.06 

.4750 

.0584 

2.46 

.4931 

.0194 

2.96 

.4985 

.0050 

1.07 

.4756 

.0573 

2.47 

.4033 

.0189 

2.97 

.4985 

.0048 

1.98 

.4701 

.0562 

2.48 

.4934 

.0184 

2.98 

.4986 

.0047 

1.99 

.4707 

.0551 

2.49 

.4036 

.0180 

2.99 

.4986 

.0048 

2.00 

.4772 

.0540 

2.50 

.4938 

.0175 

3.00 

.4987 

.0044 
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TABLE II Percentile Values of the Standard 

Normal Distribution * 



-”7 nm The KeUev Statistical Tables, Harvard University 

• Entriesin this table are taken from he KUy ^ Kelley. 

Press, 1938, revised 1948, by lefl of , Thus ... = -524 . 

x is the percentile value named bj the area 

the 30th percentile. 
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TABLE III Squares, Square Roots, and Reciprocals 


n 

71 1 

Vn 

VlOn 

1 /» 

n 

n * 


VlOn 

1 /n 

1 

l 


3.162 

1.00000 

51 

2601 

7.141 

22.583 

.01961 

2 

4 

1.414 

4.472 

.50000 

52 

2704 

7.211 

22.804 

.01923 

3 

9 

1.732 

5.477 

.33333 

53 

2809 

7.280 

23.022 

KtfTisrj 

4 

16 


6.325 

.25000 

54 

2916 

7.348 

23.238 


5 

25 

2.236 

7.071 

.20000 

55 

3025 

7.416 

23.452 


6 

36 

2.449 

7.746 

.16667 

56 

3136 

7.483 

23.664 

.01786 

7 

49 

2.646 

8.367 

.14286 

57 

3249 

7.550 

23.875 

.01754 

8 

64 

2.828 

8.944 

.12500 

58 

3364 

7.616 

24.083 

.01724 

9 

81 

3.000 

9.487 

.11111 

59 

3481 

7.681 

24.290 

.01695 

10 

100 

3.162 

10.000 

.10000 

60 

3600 

7.746 

24.495 

.01667 

11 

121 

3.317 

10.488 

.09091 

61 

3721 

7.810 

24.698 

.01639 

12 

144 

3.464 

10.954 

.08333 

62 

3844 

7.874 

24.900 

.01613 

13 

169 

3.606 

11.402 

.07692 

63 

3969 

7.937 

25.100 

.01587 

14 

196 

3.742 

11.832 

.07143 

64 

4096 

8.000 

25.298 

.01562 

15 

225 

3.873 

12.247 

.06667 

65 

4225 

8.062 

25.495 

.01538 

16 

256 

4.000 

12.649 

.06250 

66 

4356 

8.124 

25.690 

.01515 

17 

289 

4.123 

13.038 

.05882 

67 

4489 

8.185 

25.884 

.01493 

18 

324 

4.243 

13.416 

.05556 

68 

4624 

8.246 

26.077 

.01471 

19 

361 

4.359 

13.784 

.05263 

69 

4761 

8.307 

26.268 

.01449 

20 

400 

4.472 

14.142 

.05000 

70 

4900 

8.367 

26.458 

.01429 

21 

441 

4.583 

14.491 

.04762 

71 

5041 

8.426 

26.646 

.01408 

22 

484 

gglEwrij 

14.832 

.04545 

72 

5184 

8.485 

26.833 

.01389 

23 

529 

4.796 

15.166 

.04348 

73 

5329 

8.544 

27.019 

.01370 

24 

576 

4.899 

15.492 

.04167 

74 

5476 

8.602 

27.203 

.01351 

25 

625 

5.000 

15.811 


75 

5625 

8.660 

27.386 

.01333 

26 

676 

5.099 

16.125 


76 

5776 

8.718 

27.568 

.01316 

27 

729 

5.196 

16.432 

.03704 

77 

5929 

8.775 

27.749 

.01299 

28 

784 

5.292 

16.733 

.03571 

78 

6084 

8.832 

27.928 

.01282 

29 

841 

5.385 

17.029 

.03448 

79 

6241 

8.888 

28.107 

.01266 

30 

900 

5.477 

17.321 

.03333 

80 

6400 

8.944 

28.284 

.01250 

31 

961 

5.568 

17.607 

.03226 

81 

6561 

9.000 

28.460 

.01235 

32 

1024 

5.657 

17.889 

.03125 

82 

6724 

9.055 

28.636 

.01220 

33 

1089 

5.745 

18.166 

.03030 

83 

6889 

9.110 

28.810 

.01205 

34 

1156 

5.831 

18.439 

.02941 

84 

7056 

9.165 

28.983 

.01190 

35 

1225 

5.916 

18.708 

.02857 

85 

7225 

9.220 

29.155 

.01176 

36 

1296 

6.000 

18.974 

.02778 

86 

7396 

9.274 

29.326 

.01163 

37 

1369 

6.083 

19.235 

.02703 

87 

7569 

9.327 

29.496 

.01149 

38 

1444 

6.164 

19.494 

.02632 

88 

7744 

9.381 

29.665 

.01136 

39 

1521 

6.245 

19.748 

.02564 

89 

7921 

9.434 

29.833 

.01124 

40 

1600 

6.325 

20.000 

.02500 

90 

8100 

9.487 

30.000 

.01111 

41 

1681 

6.403 

20.248 

.02439 

91 

8281 

9.539 

30.166 

.01099 

42 

1764 

6.481 

20.494 

.02381 

92 

8464 

9.592 

30.332 

.01087 

43 

1849 

6.557 

20.736 

.02326 

93 

8649 

9.644 

30.496 

.01075 

44 

1936 

6.633 

20.976 

.02273 

94 

8836 

9.695 

30.659 

.01064 

45 

2025 

6.708 

21.213 

.02222 

95 

9025 

9.747 

30.822 

.01053 

46 

2116 

6.782 

21.448 

.02174 

96 

9216 

9.798 

30.984 

.01042 

47 

2209 

6.856 

21.679 

.02128 

97 

9409 

9.849 

31.145 

.01031 

48 

2304 

6.928 

21.909 

.02083 

98 

9604 

9.899 

31.305 

.01020 

49 

2401 

7.000 

22.136 

.02041 

99 

9801 

9.950 

31.464 

.01010 

50 

2500 

7.071 

22 . 3 G 1 

.02000 

100 

10000 

10.000 

31.623 

.01000 













TABLE IV Four-place Squares of Numbers 


5 




1.8 

1.9 

2.0 

2.1 

2.2 

2.3 

2.4 

2.6 

2.6 

2.7 


1.000 I 1.020 


1.210 1.232 
1.440 1.464 

1.716 
1.988 


2 . 


2.560 2 . 
2.890 

3.240 
3.610 


4.000 


4.410 
4.840 

5.290 5.336 
5.760 5.808 


.250 6 


6.760 1 6 
7.290 


r 




04 


1.264 

1.488 

1.742 

2.016 


;i 


1.277 

1.513 

1.769 

2.045 


2 


65 


.082 1.103 1.124 

1.300 1.323 1.346 
1.538 1.563 1.588 


5 


1.369 

1.613 


1.392 
1.638 


1.796 1.823 1.850 
2.074 2.103 2.132 

2 2.403 2.43 

.690 2.723 2.756 
.028 3.063 3.098 


1.877 

2.161 


2.465 


2.789 

3.133 


1.904 

2.190 


2.49 


2.822 

3.168 


3.423 3.460 
3.803 3.842 


3.497 

3.881 


3.534 

3.920 


.85 




2 


537 

973 

429 

905 


62 


4.580 4.623 
5.018 5.063 

5.476 5.523 
5.954 6.003 


7.023 

7.563 



2.8 7.840 

2.9 8.410 
8.0 9.000 | 9 

3.1 9.610 

3.2 10.24 



6 


7.076 

7.618 


8.123 8.180 
8.703 8.762 


242 I 9.303 | 9.3 
9.923 | 9 
10.56 


7.129 

7.673 

8.237 

8.821 


2 


0.05 

0.69 


7.182 

7.728 

8.294 

8.880 


88 


1.416 

1.664 

1.932 

2.220 


2.856 

3.204 

3.572 

3.960 


8 


7.236 

7.784 

8.352 

8.940 


2 


13.62 

14.36 
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TABLE IV Four-place Squares of Numbers. Continued 


N 

SJUffi 

i 

2 

3 


5 

6 

7 

8 


6.6 

30.25 



mmm 


mm 

30.91 

31.02 

31.14 

31.25 

5.6 

5.7 

31.36 

32.49 

31.47 

32.60 

31.58 

32.72 

31.70 

32.83 

31.81 

32.95 

31.92 

33.06 

32.04 

33.18 

32.15 

33.29 

32.26 

33.41 

• 

32.38 

33.52 

5.8 

5.9 

33.64 

34.81 

33.76 

34.93 

33.87 

35.05 

33.99 

35.16 

34.11 

35.28 

34.22 

35.40 

34.34 

35.52 

34.46 

35.64 

34.57 

35.76 

34.69 

35.88 

6.0 1 36.00 

36.12 

36.24 

36.36 

36.48 

36.60 

36.72 

36.84 

36.97 

37.09 

6.1 

6.2 

37.21 

38.44 

37.33 

38.56 

37.45 

38.69 

37.58 

38.81 

37.70 

38.94 

37.82 

39.06 

37.95 

39.19 

38.07 

39.31 

38.19 

39.44 

38.32 

39.56 

6.3 

6.4 

39.69 

40.96 

39.82 

41.09 

39.94 

41.22 



40.32 

41.60 

40.45 

41.73 

40.58 

41.86 

40.70 

41.99 

40.83 

42.12 

6.6 

42.25 

42.38 

42.51 

42.64 

42.77 

42.90 

43.03 

43.16 

43.30 

43.43 

6.6 

6.7 

43.56 

44.89 


43.82 

45.16 

43.96 

45.29 


44.22 

45.56 

44.36 

45.70 

44.49 

45.83 

44.62 

45.97 

44.76 

46.10 

6.8 

6.9 

46.24 

47.61 

46.38 

47.75 

46.51 

47.89 


46.79 

48.16 

46.92 

48.30 

47.06 

48.44 

47.20 

48.58 

47.33 

48.72 

47.47 

48.86 

7.0 

49.00 

49.14 

49.28 

49.42 

49.56 

49.70 

49.84 

49.98 

50.13 

50.27 

7.1 

7.2 

50.41 

51.84 

50.55 

51.98 

50.69 

52.13 

50.84 

52.27 

50.98 

52.42 

51.12 

52.56 

51.27 

52.71 

51.41 

52.85 

51.55 

53.00 

51.70 

53.14 

7.3 

7.4 

53.29 

54.76 

53.44 

54.91 

53.58 

55.06 

53.73 

55.20 

53.88 

55.35 

54.02 

55.50 

54.17 

55.65 

54.32 

55.80 

54.46 

55.95 

54.61 

56.10 

7.6 

56.25 

56.40 

56.55 

56.70 

56.85 

57.00 

57.15 

57.30 

57.46 

57.61 

7.6 

7.7 

57.76 

59.29 

57.91 

59.44 

58.06 

59.60 

58.22 

59.75 

58.37 

59.91 

58.52 

60.06 

58.08 

60.22 

58.83 

60.37 

58.98 

60.53 

59.14 

60.68 

7.8 

7.9 

60.84 

62.41 

61.00 

62.57 

61.15 

62.73 

61.31 

62.88 

61.47 

63.04 

61.62 

63.20 

61.78 

63.36 

61.94 

63.52 

62.09 

63.68 

62.25 

63.84 

8.0 

6 - 1.00 

61.16 

64.32 

64.48 

64.64 

64.80 

64.96 

65.12 

65.29 

65.45 

8.1 

8.2 

65.61 

67.24 

65.77 

67.40 

65.93 

67.57 

66.10 

67.73 

66.26 

67.90 

66.42 

68.06 

66.59 

68.23 

66.75 

68.39 

66.91 

68.56 

67.08 

68.72 

8.3 

8.4 

68.89 

70.56 

69.06 

70.73 

69.22 

70.90 

69.39 

71.06 

69.56 

71.23 

69.72 

71.40 

69.89 

71.57 

70.06 

71.74 

70.22 

71.91 

70.39 

72.08 

8.6 

72.25 

72.42 

72.59 

72.76 

72.93 

73.10 

73.27 

73.44 

73.62 

73.79 

8.6 

8.7 

73.96 

75.69 

74.13 

75.86 

74.30 

76.91 

74.48 

76.21 

74.65 

76.39 

74.82 

76.56 

75.00 

76.74 

75.17 

76.91 

75.34 

77.08 

75.52 

77.26 

8.8 

8.9 

77.44 

79.21 

77.62 

79.39 

77.79 

79.57 

77.97 

79.74 

78.15 

79.92 

78.32 

80.10 

78.50 

80.28 

78.68 

80.46 

78.85 

80.64 

79.03 

80.82 

9.0 

81.00 

81.18 

81.36 

81.54 

81.72 

81.90 

82.08 

82.26 

82.45 

82.63 

9.1 

9.2 

82.81 

84.64 

82.99 

84.82 

83.17 

85.01 

83.36 

85.19 

83.54 

85.38 

83.72 

85.56 

83.91 

85.75 

84.09 

85.93 

84.27 

86.12 

84.46 

86.30 

9.3 

9.4 

86.49 

88.36 

86.68 

88.55 

86.86 

88.74 

87.05 

88.92 

87.24 

89.11 

87.42 

89.30 

87.61 

89.49 

87.80 

89.68 

87.98 

89.87 

88.17 

90.06 

9.6 

90.25 

90.44 

90.63 

90.82 

91.01 

91.20 

91.39 

91.58 

91.78 

91.97 

9.6 

9.7 

92.16 

9 - 1.09 

92.35 

94.28 

92.54 

94.48 

92.74 

94.67 

92.93 

94.87 

93.12 

95.06 

93.32 

95.26 

93.51 

95.45 

93.70 

95.65 

93.90 

95.84 

9.8 

9.9 

96.04 

9 S .01 

96.24 

98.21 

96.43 

98.41 

96.63 

98.60 

96.83 

98.80 

97.02 

99.00 

97.22 

99.20 

' 97.42 
99.40 

97.61 

99.60 

97.81 

99.80 

N 

g|Q||| 

■OH 

2 

3 

4 

5 

6 

7 

8 

9 
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Reprinted abridged from R. A. Fisher and F. Yates, Statistical Tables 



TABLE VI Percentile Values of r for n Degrees of Freedom 

when p =• 0 * 


n 

r. 95 

7.975 

r.99 

7.995 

7.9996 

n 

r.95 

7.976 

Tm 

7.995 

7.9996 

i 

.988 

.997 

.9995 

.9999 

1.000 

30 

.296 

.349 

.409 

.449 

.554 

2 

.900 

.950 

.980 

.990 

.999 

35 

.275 

.325 

.381 

.418 

.519 

3 

.805 

.878 

.934 

.959 

.991 

40 

.257 

.304 

.358 

.393 

.490 

4 

.729 

.811 

.882 

.917 

.974 

45 

.243 

.288 

.338 

.372 

.465 

5 

.669 

.754 

.833 

.874 

.951 


.231 

.273 

.322 

.354 

.443 

6 

.622 

.707 

.789 

.834 

.925 

55 

.220 

.261 

.307 

.338 

.424 

7 

.582 

.666 

.750 

.798 

.898 

60 

.211 

.250 

.295 

.325 

.408 

8 

.550 

.632 

.716 

.765 

.872 

65 

.203 

.240 

.284 

.312 

.393 

9 

.521 

.602 

.685 

.735 

.847 

70 

.195 

.232 

.274 

.302 

.380 

10 

.497 

.576 

.658 

.708 

.823 

75 

.189 

.224 

.264 

.292 

.368 

11 

.476 

.553 

.634 

.684 

.801 

80 

.183 

.217 

.256 

.283 

.357 

12 

.458 

.532 

.612 

.661 

.780 

85 

.178 

.211 

.249 

.275 

.347 

13 

.441 

.514 

.592 

.641 

.760 

90 

.173 

.205 

.242 

.267 

.338 

14 

.426 

.497 

.574 

.623 

.742 

95 

.168 

.200 

.236 

.260 

.329 

15 

.412 

.482 

.558 

.606 

.725 

100 

.164 

.195 

.230 

.254 

.321 

16 

.400 

.468 

.542 

.590 

.708 

125 

.147 

.174 

.206 

.228 

.288 

17 

.389 

.456 

.528 

.575 

.693 

150 

.134 

.159 

.189 

.208 

.264 

18 

.378 

.444 

.516 

.561 

.679 

175 

.124 

.148 

.174 

.194 

.248 

19 

.369 

.433 

.503 

.549 

.665 

200 

.116 

.138 

.164 

.181 

.235 

20 

.360 

.423 

.492 

.537 

.652 

300 

.095 

.113 

.134 

.148 

.188 

22 

.344 

.404 

.472 

.515 

.629 

500 

.074 

.088 

.104 

.115 

.148 

24 

.330 

.388 

.453 

.496 

.607 

1000 

.052 

.062 

.073 

.081 

.104 

25 

.323 

.381 

.445 

.487 

.597 

2000 

.037 

.044 

.016 

.058 

.074 


r~. os 

•“.026 

— 7.01 

“7.095 



nn 

S3 

“7.01 

—7.006 



* Reprinted abridged from R. A. Fisher and F. Yates, Statistical Tables , published 
by Oliver and Boyd Ltd. by permission of the authors and publishers. 
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TABLE VII 


1. /I + r *\ 

Values for Transforming r into z = ^ lo 9.11 _ r ^ 
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TABLE VIII Scores Made by Students in a First Course in Statistical 

Examinations in Subject Matter of the 


Prognostic Test Score Criterion Score 


Code 
number 
of student 

Section 

Reading 

Artifi¬ 

cial 

language 

Arith¬ 

metic 

test 

Mid¬ 

term 

test 

Final 

exam 

Semester 

grade 

1 

III 

33 

51 

33 

58 

65 

62 

2 

III 

39 

50 

36 

53 

51 

52 

3 

I 

38 

38 

39 

52 

53 

53 

4 

I ' 

23 

27 

29 

47 

42 

45 

*5 

I 

44 

40 

41 

61 

50 

56 

*6 

I 

45 

54 

38 

47 

53 

50 

7 

I 

48 

57 

42 

64 

64 

64 

8 

• 

II 

38 

39 

35 

64 

54 

59 

9 

II 

32 

46 

23 

58 

50 

54 

*10 

II 

28 

45 

.37 

44 

45 

45 

11 

II 

33 

53 

28 

62 

63 

63 

12 

II 

40 

43 

25 

52 

50 

51 

13 

II 

34 

55 

38 

50 

56 

58 

14 

II 

34 

57 

36 

71 

68 

70 

15 

II 

35 

46 

37 

62 

65 

64 

16 

I 

32 

43 

39 

52 

59 

56 

17 

II 

34 

44 

38 

59 

57 

58 

18 

II 

42 

51 

35 

53 

60 

57 

19 

I 

32 

55 

36 

62 

56 

59 

*20 

III 

24 

34 

17 

37 

43 

40 

21 

I 

39 

52 

34 

55 

53 

54 

22 

I 

34 

25 

29 

30 

42 

36 

23 

II 

44 

58 

35 

61 

56 

59 

24 

III 

27 

29 

26 

49 

40 

45 

25 

II 

34 

52 

30 

43 

38 

41 

26 

II 

40 

53 

29 

61 

54 

58 

27 

I 

37 

34 

26 

47 

57 

52 

28 

III 

22 

40 

33 

47 

50 

49 

29 

I 

30 

54 

40 

56 

47 

52 

*30 

I 

36 

58 

25 

52 

39 

45 

31 

I 

35 

43 

40 

49 

37 

43 

32 

I 

39 

58 

32 

49 

37 

43 

33 

II 

37 

28 

14 

43 

36 

40 

34 

II 

38 

51 

26 

40 

42 

41 

35 

II 

36 

55 

31 

59 

52 

56 

36 

I 

46 

52 

39 

61 

63 

62 

37 

II 

44 

57 

32 

46 

53 

50 

38 


40 

56 

32 

64 

49 

57 

39 

III 

38 

30 

18 

25 

38 

32 

40 

I 

50 

47 

32 

58 

52 

55 

41 

II 

16 

49 

26 

52 

36 

43 

42 

I 

46 

60 

43 

61 

57 

59 

43 

I 

42 

56 

40 

59 

54 

57 

*44 

III 

27 

53 

26 

62 

48 

55 

45 

III 

33 

55 

25 

55 

57 

56 

46 

II 

40 

54 

34 

52 

57 

55 

47 

I 

46 

53 

37 

62 

49 

56 

*48 

I 

46 

58 

39 

58 

56 

57 


* A woman student. 
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Methods on Three Prognostic Tests Given at Beginning of Term, Two 
Course, and the Semester Grade 


Code 
number 
of student 


Section 


*49 

*50 

*51 

*52 

53 

54 

55 

56 

57 

58 

59 
*60 

61 

62 

63 

*64 

*65 

66 

67 

*68 

69 

70 

71 

72 
*73 

74 

75 

76 

77 

78 
*79 

80 

*81 

*82 

*83 

84 

85 

86 
*87 
*88 

89 

*90 

*91 

*92 

93 

*94 

95 

*96 

97 

98 


III 

I 

I 

III 

II 
I 

I 

III 
III 
III 

II 

III 
III 
I 

III 

I 

III 

III 

III 

II 
II 
II 

I 

II 

I 

II 

III 
III 
I 

III 

I 

I 

I 

I 

I 

I 

III 

III 

I 

I 

III 

III 

I 

II 

III 
III 
I 

I 

I 


Prognostic Test Score 

Artifi- Arith- 

Rending cial metic 
language test 


Criterion Score 


Final Semester 

term exam grade 
test 


25 

26 
42 

34 
28 
21 

44 
38 
16 
40 
33 

35 

33 
30 
38 

36 

45 
32 
36 
21 

30 

34 
41 
44 
36 
58 
36 

31 
40 
44 
36 
44 
34 
38 
38 
38 
46 
30 
26 
36 
36 
32 

30 

31 
44 
14 
44 
30 
39 

32 


57 

45 

46 
60 
37 
24 
43 

47 
50 

48 

42 
59 

40 

41 
47 

43 

57 
32 

53 
46 

54 

49 

58 

55 
14 

50 
41 
14 
46 
60 
36 

51 
46 

56 

52 
51 
51 

42 
44 

58 

43 

59 
23 
58 
34 
16 
26 
58 
34 
56 


24 
18 
31 
33 

8 

16 

35 

28 

27 
35 

25 

25 

28 
30 
39 

26 

35 
21 
27 
39 
34 
27 
39 
38 
20 
38 
30 
18 

36 
43 
23 

36 

37 
36 
33 
33 
40 
22 
28 
42 
42 

36 
28 

31 

32 
18 
30 

37 

33 
30 


46 

47 
58 
62 
47 
44 
58 
58 
47 
56 
52 

52 

49 
56 
62 
61 

53 

50 

49 
44 
47 

50 
53 
47 
30 
68 
62 
27 
58 
71 

49 
53 

43 
64 

50 
50 
62 

37 

58 

59 
46 
61 

38 

44 
55 

40 
37 
61 

41 
65 


49 

25 

56 

64 

46 
41 

55 

56 

47 
38 

46 

49 

47 
59 
63 

50 
46 
41 
34 
33 
46 

44 
57 

46 
40 

55 

56 
30 
50 

45 

43 

47 
49 
61 
56 

48 

54 
33 

48 
69 

44 

55 
38 
51 

49 
33 
19 

56 
46 
51 


48 
36 
57 
63 
47 
43 
57 

57 
47 

47 

49 
51 

48 

58 
63 
56 

50 

46 
43 
39 

47 

48 
55 
47 
35 
62 

59 
29 
54 
58 
46 
50 
46 

63 
53 

49 
58 
34 
53 

64 
45 

58 
38 
48 
52 
37 
28 

59 
44 
58 


TABLE IX Scores for 109 Fourth-grade Pupils 
on the Modem School Achievement Test 


g — : -— 

Age to 
Nearest 
Month 



Score 

on Test of 


Case 

Number 

4 

I.Q. 

Arithmetic 

Computa¬ 

tion 

Arithmetic 

Reasoning 

Reading 

Speed 

Language 

History- 

Civics 

1* 

121 

99 

12 

11 

27 

17 

24 

2 

124 

83 

13 

4 

. 12 

15 

13 

3* 

103 

117 

5 

8 

30 


10 

4* 

127 

83 

8 

6 

30 

12 

21 

5 

115 

109 

7 

4 

26 

27 

13 

6 

108 

111 

6 

3 

17 

21 


7* 

106 

92 

9 

9 

25 

30 

12 

8* 

115 

95 

5 

3 

28 

12 

16 

9 

140 

79 

8 

3 

28 

23 

11 

10 

111 

115 

10 

8 

20 

26 

4 

11 

109 

120 

5 

5 

26 

19 

10 

12 

119 

101 

6 

4 

24 

14 

20 

13 

119 

87 

10 

5 

8 

14 

17 

14 

108 

106 

7 

7 

10 

16 

5 

15 

131 

100 

6 

5 

21 


8 

16* 

112 

120 

13 

7 

21 

• 

19 

8 

17 

118 

99 

4 

4 

26 

16 

10 

18 

96 

119 

7 

7 

20 

17 

12 

19 

148 

66 


2 

16 

11 

9 


132 

77 

5 

3 

27 

19 

21 

21* 1 

108 

124 

8 

6 

29 

29 

9 

22* 

96 

125 

10 

5 

30 

18 

16 

23 

118 

104 

9 

4 

18 

23 

21 

24 

123 

89 

6 

5 

13 

19 

13 

25* 

109 

97 

8 

5 

28 

26 

2 

26 

106 

126 

11 

11 

29 

30 

18 

27* 

142 

68 

5 

2 

7 

17 

10 

28 

109 

83 

6 

3 

16 

23 

15 

29 

150 

64 

6 

1 

11 

7 

7 

30 

113 

96 

4 

4 

25 

26 

19 

31 

117 

91 

6 

4 


19 

2 

32* 

156 

75 

12 

5 


18 

15 

33* 

119 

103 

8 

5 

14 

22 

18 

34* 

129 

86 

9 

4 

8 

14 

6 

35 

114 

118 

9 

3 

24 

19 

10 

36 

106 

140 

13 

9 

40 

31 

10 

37 

114 

120 

12 

9 

17 

21 

18 

38 

134 

93 

9 

3 

17 

24 

12 

39* 

109 

112 

7 

2 

31 

24 

18 

40 

115 

116 

15 

5 

41 

33 

22 

aa 


• All case numbers starred refer to boys, unstarred numbers to girls. 
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TABLE IX Scores on Achievement Test. Continued 
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TABLE IX Scores on Achievement Test. Concluded 


Case 

Number 

Age to 
Nearest 
Month 

I.Q. 

Score on Test of 

Arithmetic 

Computa¬ 

tion 

Arithmetic 

Reasoning 

Reading 

Speed 

Language 

History- 

Civics 


mm 

m 

7 

7 

24 . 

23 

2 

'ffiSP’vR - 7 

BRS 


6 

5 

40 

36 

17 

83* 

BTTjV 

104 

7 

3 

28 

29 

19 

84* 

B • B 

123 

10 

9 

42 

30 

18 

85 

109 

133 

8 

7 

31 

26 

9 

86 

114 

127 

9 

9 

28 

24 

7 

87 

107 

138 

3 

6 

46 

36 

4 

88 * 

112 

132 

4 

1 

44 

31 

12 

mm 

107 

136 

9 

2 

29 

32 

5 

90* 

113 

119 

3 

7 

30 

22 

9 

91 

115 

106 

9 

4 

24 

17 

3 

92* 

108 

124 

1 

5 

50 

29 

23 

93 

108 

124 

9 

4 

48 

14 

7 

94* 

122 

102 

6 

2 

26 

20 

5 

95* 

124 

83 

8 

3 

10 

15 

6 

96* 

126 

93 

7 

6 

16 

26 

14 

97 

104 

113 

8 

8 

30 

23 

8 


115 

90 

0 

2 

24 

18 

5 

99* 

117 

103 

8 

5 

27 

17 

5 

100 * 

111 

88 

12 

5 

10 

11 

6 

101 * 

115 

111 

9 

7 

16 

17 

10 

102 

103 

130 

2 

2 

41 

25 

2 

103 

122 

91 

4 

8 

32 

9 

5 

104* 

113 

106 

7 

3 

15 

10 

6 

105 

111 

115 

6 

2 

43 

28 

7 

106 

114 

120 

1 

3 

27 

26 

8 

107* 

118 

116 

11 

4 

40 

19 

5 

108 

109 

120 

9 

2 

50 

35 

4 

109 

118 

123 

10 

6 

41 

18 

4 


• All case numbers starred refer to boys, unstarred numbers to girls* 
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TABLE XII. Values of fc = Np Leading to Rejection of a Hypothesis 
about a Proportion at .05 Level of Significance 

N — Number of cases in sample 
k = Number of cases with given characteristic 
P = Proportion under hypothesis tested 

p = k/N 

A. Two Tails Region (.025 in each tail) 


For P g .5, reject hypothesis if observed k is less than or equal to left value in cell or is greater 
tlian or equal to right value. 

For P > .5, substitute 1 - P for P and N - k for k. 
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* There is no region of rejection in the indicated tail. 

• Increasing this number by 1 will not increase the lower tail to more than .026. 
b Decreasing this number by 1 will not increase the upper tail to more than .026. 
e Changing * to 0 will not increase the lower tail to more than .026. 
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(TABLE XII, Conf.) B. Upper Tail 
Region (.05 in upper tail) 

For P g .5, reject hypothesis if observed k is 
equal to or greater than cell value. 

For P > .5, substitute 1 - P for P and N 
for fc; use Table C. 



(TABLE XII, Cent.) C. Lower Tail 
Region (.05 in lower tail) 

For P ^ -5, reject hypothesis if observed k is 
equal to or less than cell value. 

For P > -5, substitute 1 - F for F and A 
for k ; use Table B. 

a 


20 , * 
Of '■ 


b Decreasing this number by 1 «U1 n0 * 
increase the upper tail to more than .051. 
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• No region of rejection is available, ^ 

• Increasing this number > 
increase the lower tail to more than 051. 
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GLOSSARY OF SYMBOLS 


The letters X and Y (and sometimes Z) are here customarily used to 
denote variables and observations on individuals. Constants are denoted 
by letters at the beginning of the alphabet. Small letters from the middle 
of the alphabet, especially i and j, are used as subscripts to indicate refer¬ 
ence to specific individuals or groups. 

Parameters are customarily denoted by Greek letters and sample 
statistics by English. Some exceptions are made, as in the use of P and Q 
to denote proportions in a population. 

A bar over a letter indicates the mean of a sample of observations on 
the variable denoted by the letter. 

Greek letters and English letters are listed separately according to their 
respective alphabets. Symbols of operation are listed separately. 

Each definition of a symbol is followed by the page number (in paren¬ 
theses) which shows the first reference to the symbol in the text. 

Symbols used only once for a specific purpose and lacking in general 
interest are not listed below. 

SYMBOL DEFINITION 

A (1) Arbitrary origin (93). 

(2) Alternative to a hypothesis (236). 

a Constant terms in a regression equation. The first 
a yx> a xy subscript indicates the variable to be predicted; 

the second indicates the variable which has been 
observed (127, 145). 

b Regression coefficients used as multipliers of known 
b yx , b ry values in regression equations to compute predicted 
values. For order of subscripts see a zy , a yx (127, 145). 

c (1) Confidence coefficient (220). 

(2) Subscript for combined groups (101). 

d Difference between two ranks assigned to the same 
individual (159). 

di Difference between the mean of the tth group and the 
mean of combined groups, X, - X c (101). 

e A constant having a value of approximately 2.718. 
It has many uses in mathematics but is used in this 
book only in the equation of the normal curve (184). 

/ Frequency in distributions (51). 
fx, f y Marginal frequencies of a bivariate distribution (125). 
fxv Frequency in a cell of a bivariate distribution (125). 

H Hypothesis (236). 

H: m = k Hypothesis that n — k; k a specific number (236), 
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SYMBOL 

l 

i, h 

I,J 

k 

log, 

logio 

N 

n 

P 

V 


DEFINITION 

Size of a class interval in scale units (96). 

Subscripts used to indicate reference to specific indi¬ 
viduals in a group. 

Shapes of curves (115). 

(1) Number of subgroups of a total group (101). 

(2) k = Np (253). 

Logarithm to base e (259). 

Logarithm to base 10 (260). 

Number of cases in a sample, sometimes used with a 
subscript to indicate a subsample (67). 

Number of degrees of freedom (222). 

The proportion of cases in one class of a dichotomous 

population (246). 

The proportion of cases in one class of a sample from 
a dichotomous population (245). 


Q 

Q\, Ql 

Qz, Qv 

Q 

R 


r 


Tgy.t 


r. o* 
8 

s 

Sy.i 

8 1 

Sx 


# 7 ,- 3 , 



1 -P (246). 

First quartile or lower quart ile (72). 

Third quartile or upper quartile (72). 

1 -p (245). 

(1) Rank order correlation (159). 

(2) Correlation derived from a computed correlation 
055, 169). 

Coefficient of correlation; subscripts may be used 
to indicate variables being correlated (142). 

Partial correlation between x and y, when variation 


z is eliminated (171). 

he fifth percentile of the distribution of r (259). 

ample standard deviation; may be used with sub- 

•ript to indicate the variable (90). 

ample standard deviation, used when needed to 

istinguish from small s (169). 

tandard error of estimate (132). 

r ariance, or square of standard deviation (90). 

,tandard error of a mean estimated from a sample 

Itandard error of a difference between means (233). 
(tandard error of difference between proportions 

n + 'lOr, where 2 is a scale value of the standard 
normal distribution (196). 
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SYMBOL 

t 


X , Y 
X, ? 
X, Y 


x, y 

x', y' 

y 

z 

z 


Zr 

z. OS 

f 

M 

7T 


P 

2 

a 

<T 2 

&x> °"p> ^pi-pi 

<r r , cr Zr 


a > b 
a < b 


a ^ b 


a ^ b 



DEFINITION 

A scale value of Student’s distribution or a variable 
which has this distribution (221). 

Variables, or gross scores of variables (87). 

Means of samples (88). 

Values predicted by regression equations (127). 

The 50th percentile; similarly for other percentiles 

(71). 

X - X, Y - Y (143). 

Coded scores (96). 

Ordinate of normal curve (184). 

50+10 {X - X)/s (196). 

Scale value of the standard normal distribution, or 
standard normal deviate. Also a variable which has 
this standard normal distribution (181). 
Transformation used in testing the significance of a 
correlation coefficient (259). 

The fifth percentile of the normal distribution (187). 
Population value corresponding to the statistic z r 
(260). 

Population mean (179, 208). 

Ratio of circumference of a circle to its diameter; 
approximately 3.1416. 

Correlation coefficient in a population (258). 

The sum of (87). 

Standard deviation in a population (179, 208). 
Variance in a population (208). 

Standard error of quantity noted as subscript (213, 
249, 255, 258, 2G0). 

Measure of relationship between two dichotomous 
variables (158). 

Is not equal to (234). 
a is greater than b (228). 
a is less than b (228). 

The absolute or numerical value of a, the sign being 
taken as positive (89). 
a is greater than or is equal to b (282). 
a is less than or is equal to b (282). 

The positive square root of a (90). 


co Infinity (271). 



list of formulas 
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PAGE 

FORMULA 

. , N{N +1) 

81 

(6.1) 

Sum of ranks - ^ 

(7.1) 

A N 

88 

(7.2) 

2(X - X) = 0 

2 1X - X | 

88 

89 

(7.3) 

Mean deviation from the mean ^ 


Z(X - X) ! 

89 

(7.4) 

Variance - ft - i 



NIX' - (XX)' 

90 

(7.5) 

Variance - - _ fj 


(7.6) 

, /six - X)* 

s = v variance - y \ 

90 

(7.7) 

T 2/X 

N 

92 


Nsrx' - (Xfxy 

92 

(7.8) 

Variance - N(N - \) 



_ . . 2(X- A) 

93 

(7.9) 

X-A+ N 



V2(X - AY - [2(Xj^D2 

Variance = W(V - 1) 

ix' = X - A 

*—•(¥) 

, fNlf( x')' - (Zfx'A 
Variance =* i* ^ V(V - 1) / 

(2X) 2 

2x 2 = 2(X - X) 2 = 2X 2 - -jf 

r (2/x'H 

Sx . = i*[2/(xo*-4rJ 

(2X) 2 


2X 2 - 


(7.16) 


s 



94 

9G 

96 

96 

97 
97 


97 
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97 

(7.17) 


N — \ J 

Raising the arbitrary origin by r intervals: 

97 

(7.18) 

changes 2x' to 2x' — Nr 

(7.19) 

and changes 2(x') 1 to 2(x') s — 2r2x' 4- Nr* 

97 

Lowering the arbitrary origin by r intervals: 


(7.20) 

changes 2x' to 2x' 4- Nr 

97 

(7.21) 

and changes 2(x') 2 to 2(x') 2 4- 2r2x' 4- Nr 1 

97 

(7.22) 


N c = AT, + AT, + • • • + N k 

101 

(7.23) 

x e = 

~j- (XiX, 4- N 2 ^2 4- • • • 4 - N Jtk) 

101 

(7.24) 


di — Xi — X c 

101 

(7.25) (N c 

- l)s* = (N 1 - 1)8*, + (N 2 - 1)8* + • • • + (N* - l)sf 



+ Nid[ + Ntdl 4- • • • 4- Nkdl 

101 

(9.1) 

• 

Y = a 4- bX 

122 

(9.2) 


N2XY- (2X)(2F) 
b NXX* - (XX)* 

127 

(9.3) 


a = Y — 6X 

127 

(9.4) 


H 

II 

1 1 

10 3' 

132 

(9.5) 2 (Y 

-iy-r 

f nx y* ffln* [" 2 xr-(sx)( 2 r)?l 

{ K ) AT2X*-(2X)* J 

132 

(9.6) 

2(F 

- F)* = 2(F - f)* 4- 2(F - F)* 

133 

(9.7) 


? * = a + &X* 

135 

(9.8) 


^ vertical segment _ v 

horizontal segment h 

135 

(10.1a) 


2 

2(F - F) 1 

142 

(10.1b) 


2(X - X)* 

2(X - X)* 

142 

(10.2a) 


2(X — X)(F — F) 
v^2(X - X)*2(F - F)* 

143 



2xy 

f = —---- 

143 

(10.26) 


V2x*2y 2 
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rrzxY - C£X)(2Y) 

143 

(10.3a) 




NZf.jc'v' - (SW)CZLy') 

143 

(10.3b) 

r V] NEf.(x'y - (2/^OM 1- (2/.!/ VI 



,S, 

144 

(10.4a) 

r = b- 

Sy 



. s y 

144 

(10.4b) 

b = r 7. 


(10.5a) 

Y = Cly X + by t X 

145 

145 

(10.5b) 

X - a xv + b Jv r 



Arsxr- (2X)(sr) 

146 

(10.6a) 

bl,x " N2X 2 - (2X) 5 


• 

v2xy-(2X)(2n 

147 

(10.6b) 

biv " N2K l -(2K) 2 


(10.7) 

a„ = 7 - k,.x a" 11 °» = X - i, ' ,r 

147 



148 

(10.8) 

**“ ’'V jv(x -1) 



. . /NB - 

148 

(10.9) 

S » = , «V N{N~l) 



nd-ac 


148 


(10.10) 

r V(NE - C*){NB — A?) 



v iy ND - AC 

148 

(10.11) 

~ i z ' NE - C l 



i t ND-AC 

148 

(10.12) 

hzv ' Ty ' NB - A 2 



2r 

155 

(10.13) 

R ' \ + r 



l 

cs 

158 

(10.14) 

+ ~ V{a + md + c){b + d){a + c) 



02rP 

159 

(10.15) 

^ = 1 N{N 2 - 1) 


(11.1) ' = 

(2A*.)(SJ*,) (?.Y,)(21> 

«\<r If ' --— k • 

) (2.Y ,1(2 VO 


IXiYx + Z XiY* + 2X,Y* - 

I , Czxl)* (2**)* 

N/2XI+—ar 


. r ■ (w>.r i- 1 ? 

.g.V.?Jv, 1+ v, 1+ v P> .» A>1 - v . 

/Vj 


(2T,V 
A j 


107 
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(13.1) 

Vn 

213 

(13.2) 

x-x 

z -- 

s 

217 

(13.3) 

X-fM 

z <T 

217 

(13.4) 

X-M 

2 (»/vTv) 

217 

(13.5) 

(S/VN) 

218 

(13.6) 


220 

(13.7) 

. X-M . 

V (w'.v) < i? 

221 

(13.8) 

X + z 1 _ e < M < X + z.+. v77 

221 

(13.9) 

x + ^vV and X + W* 

224 

(14.1) 

sx.-*. = \/w + N, 

233 

(14.2) 

(X, — X 2 ) — (Mi “ Ms) 

2 i/4 + 4 

\ N 2 

233 
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246 
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(15 1) Mean °* a dichotomous population r 

(15.2) Variance of a dichotomous population = P(1 - P) 

(15.3) Standard deviation . _ y /p ( . __ m 246 

v ’ of a dichotomous population - ) 

A i A 

/jrM ean of p = p P = P 

2 F(l-P) 

• _ .i M * — 

(15.5) 


(15.6) 


Variance of p = ^ 

ard deviation of p = o"p = \/ 


249 

249 


249 


(15.10) Region of rejection for lower-tailed test is 



/pO- 

/ V 

_p) 

249 

2^ + 

/pd- 
/ V 

_p) 

249 

test is 




1 . 


-P) 

251 

+ 2W + Z “ 

V v 

1 test is 




-^ + 2 “ 

t/® 

-P) 

V 

251 


• • 


(15.11) Two-tailed region of rejection is 

1 . fi76 y / ^IZl>andp<F'2]V + Z .o«V 


p > P + + 2 


P(1 - P) 

z.ks V 


2V 


(15.12) 


(15.13) 


'p.-p.^V Vi 


3p,-pi 


pd - v) 

—nT + 

N 



251 


255 


255 


(15.14) 


pi - p 2 - 2 V,V 2 

z = i /pu~- P } + 

V ivr" + 

v_ 

p 2 - p2 - 2V..V 


255 


255 


(15.15) 


2 = 


p(i - P> + 


V v, 




V 


292 

FORMULA 

(16.1) 

(16.2) 
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(16.4) 
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(16.6) 

(16.7) 

(16.8) 
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<T r 
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1-P* 

' Vn- 1 

rVN- 2 

vT^T 3 
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z r = 1.1503 logio | 

z r = 1.1503 [logic (1 + r) - logio (1 - r)] 

°‘ r= vih% 

Upper limit = z r + Z i+ e ~ 3 

2 

Lower limit = z r 4- Z i- e — 3 

2 

c* - **) - (ri - r*) 

V ATj - 3 + iV, - 3 
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^ r yt ) y 2(1 _ ^ _ t ,' _ ^ + 2r xv r„r t t 
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260 

260 
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263 
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ANSWERS 


Exercise 3.2, M 

a (a) 20 8 years; 28.5 years; 24.9 years; ^1920 itwas slightlyhigher for 

2 * (&) ^whites! (b) Not if all races ^crepancy has increased 

nonwhite males than f° r , . D t~ * (dl Sfpancy has decreased foi: both.males 
for both whites an ■>“ wh ^ 8 - $ WlVw; 1920-M; 1910-M; WW 

(2) 1910-20; 1910-20 $ ' are not constant throushcut the table. 

increase was 2.4 years, ivnv , ,, a _ fi 

3. (a) 0 + 69.6 = 69.6; 20 + 52.4 = 72.4; 70 + • 

(b) No. (c) Ages 65 and 70; age 70 


4. 

16 

33 


35 

14 


51 

47 


49 


Exercise 5.1, poflt 55 


(a) 0.5-7.5; 1 <b) 6-8; 2 (c) 6.75-7.25; u.n ta,- 

, M So, 20.1, 20.2,203 (c) 17.0, 20.0, 23.0, 26.0, 29.0 
(d) 19.99, 20.00, 20.01, 20.02, 20.03 

Exercise 5.2, pofl« 57 
Real limits Score limits 


1. (b) Scores in interval 

9.5, 10.0, 10.5 
8.0, 8.5, 9.0 

6.5, 7.0, 7.5 

3. (a) Score limits 
17-19 
14-16 

(b) Score limits 

30-39 
20-29 
10-19 

(c) 70-76 
63-69 
56-62 


9.25- 10.75 
7.75- 9.25 

6.25- 7.75 

Scores in interval 

17, 18, 19 
14, 15, 16 . 

Real limits Claw index 

29.5- 39.5 

19.5- 29.5 
9.5-19.5 

69.5- 76.5 

62.5- 69.5 

55.5- 62.5 


34.5 

24.5 

14.5 
73 
66 
59 


9.5- 10.5 
8.0- 9.0 

6.5- 7.5 

Real limits 

16.5- 19.5 

13.5- 16.5 
• 

t 

10 

10 

10 

7 3 

7 

7 


Class index 
10.0 
8.5 
7.0 

Class index 
18 
15 


i 

1.5 

1.5 

1.5 

i 

3 

3 


35-37 and 38-40; * = 3 


Exercise 6.1, p«fl* 7e 

3. No; no. 


4. ill; I; I; l ’> 11 

2. (a) Not possibly, (b) No. 

5 . 63.1; a score; above, not in. Q,; in combined 

a. In I, nearer to Qt) in II, a little nearer to Q„ 

group, a little nearer to Qt. n. A stands higher. 

,.B.D.A.C 10. C, A, B, E, D 


*. B. D, A, c " c or oc M CF (0 Area CFJM 

(g) Area to right of GI or area GH l ■ B is the tenth percentile or A io 

is fal Vertical (b) A is the fifth percentile or X.«, B x v . W)is X.«rX., 0 

8 §&§ ^ oi the d '“ tnbul, ° i - 

Ex •relit 6.2, pofle »1 


1, Score: 25 21 18 ^ 2 g 

Rank: Sum of tanks - 28 - 7(8)/2 

1. Score; 6 6 *o 61 *1 

RSDk: IL of rank. - 45 1 9(10)/2 


21 

4 


21 

4 


8 

5 


12 

3 


5 

8 


2 

9 
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3. Score: 
Rank: 


4. Score: 
Rank: 


12 8 7 10 6 

3 64 9 4 11 

Sum of ranks = 78 = 12(13)/2 


7 

9 


5 4 7 9 8 5 

10 12 7 3* 54 10 

Sum of ranks = 105 = 14(15)/2 


3 

13 


9 

5 


5 

10 


7 13 
9 2 


9 12 

3* 2 


8 

6i 


8 

5 * 


14 

1 


2 

14 


5 

12 


13 6 

1 8 


Exercise 6.3, pcfl* 83 

1. J5840 is the 57th percentile. 57 is the percentile rank of $5840. 

2. 12 is the 87th percentile. 87 is the percentile rank of 12. 

3. The 50th percentile is 72.0. The percentile rank of 72.0 is 50. 

4. The 95th percentile is $6850. The percentile rank of $6850 is 95. 


Exercise 7.1, page 88 

1. X - 5.7; X. M = 5.5 2. X = 16; X.so = 18 


3. X — 409; X.u — 409.5 


Exercise 7.2, page 92 



Percentile rank of 10.2 is 79 because 


17 + .7(4) 
25 


.792. 



Percentile rank of 5.4 


is 19 because 


2 + .9(3) 
25 



6. and 7. 


X 

(a) 

9 

9 

(b) 

26 

25.4 

(c) 

10.5 

9.1 

(d) 

1.2 

3.5 

(e) 

2.0 

1.0 

(f) 

-4.0 

-2.0 


8. The standard deviation can 


8 s 

3 

Range/* 

15.8 

3.97 

3.3 

5.8 

2.41 

2.5 

27.6 

5.26 

2.7 

18.9 

4.35 

2.5 

50.5 

7.11 

2.4 

40.7 

6.38 

2.8 


larger than the mean. 



3.33 





Exercise 7.3, page 99 


1. 

For ungrouped data 

i 

II III 

Combined group 



X 

55.0 

52.8 48.2 

52.4 



s' 

68.3 

80.1 95.4 

86.1 



s 

8.3 

8.9 9.8 

9.3 


For grouped data 






X 

54.9 

52.7 48.1 

52.3 



s' 

67.5 

78.3 95.1 

85.1 



8 

8.2 

8.8 9.8 

9.2 

3. 

Yes. 





4. 


Grouped 

Ungrouped 

5. III-II-I 


I 

-1 

.0 

-2.6 

III-II-I 


11 

— 

.2 

.4 



III 

— 

.1 

-.8 



Combined group 

— 

.8 

-.1 


6. 

Section 

Grouped 

Ungrouped 



I 

44/8.2 

= 5.4 

41/8.3 = 4.9 



II 

44/8.8 

= 5.0 

41/8.9 = 4.6 



in 

39/9.8 

= 4.0 

37/9.8 = 3.8 



Combined 

49/9.2 

= 5.3 

46/9.3 = 4.9 



M 

» 

X.ro < X.40 

< X ,i0 

P 

» 



p 

(b) 



M 

. 0) 2(X - 

11) = N + 2(X 

- 12) 

M 

.(c) 

M 

* 

i 

II 

= N + 2(X - 6) 

P 

. (i) 



.u 

(d) 

2(X - X) 

- 0 

M 

(k) 2(X - 

X) 1 is smaller 

than 

p 

M 

.(c) 

.(0 

69 - 3 = g 

;.9 which is unrea¬ 

P 

P 

Z(X 

(m) 

_(n) 

— A)' when A * 

5 


7.4 

sonably large. 




P 

_ (6) 









ANSWERS 




1. Rang® 

Below X -i 

X-«toX 

X to X + a 
Above A + * 

Below X — 2a 
X — 2a to A 
X to X + 2a 
Above X + 2a 

Jones 

66 
52 
51 
66 
59 


ExarcUa 7.4, po B * 105 

Without interpolation 

Number 
11 
38 
32 
17 


2 . 


A 

a 

c 

D 

Mean 


4 

45 

49 

0 

Smith 

53 
48 
57 
57 

54 


Percent 

11.2 

38.8 
32.7 
17.3 

4.1 

45.9 
50.0 

0.0 


With interpolation 
Number 


13.1 
35.9 
34.7 
14.3 

4.0 

45.0 

48.2 

.8 


Percent 

13.4 

36.6 

35.4 

14.6 

4.1 
45.9 
49.2 
.8 


Brown 

41 

53 

52 

45 

48 


English usage 
Algebra 
U.S. history 


(X - X)/s 

-.95 

.57 

1.28 




31.6 + .68(25) 
31.6 + .66(39) 


4. r 


ExarcUa 9.1, P«B* I 40 

AO 1 v - ? = 52 - 48.1 
57.3 r-t =«-57.3 


_ 11,500 . § = .11 
b * x m 177,400 3 


3.9 

- 13.3 


_ 11,500 .3 =03 
' 212,875 5 


4. No. *• 7 


6 . 6 


1. No. 


11.500 __ 

V(lTM00)(212^75) 

ExarcUa 10.2. pofl« 153 

2 were in both groups. 3- No - 

ExarcWa 10.3, P«B« ' 55 
, 2(.52) _ M . not necessarily; something not far from -5- 
| 52^ 1 

a . m. - .62. The result is less likely to be achieved. 

l ' 46 '“l proportionately more than V- 

a All decreased; s. proportionally more than „ ^ more than ... 

,. AU decreased; .. proportionately more *■; £ > M more ... 

4. All increased; ^ ^ 5., proportmnately more ... 

5. All increased; a, proportionately m 

ExarcUa 11A P"«* 172 

—■ *n f,;S 

A.>.f J $ 6. .46 7' 7486 D. £ JWjj S X 

iss tss SSR — 


A. 1. .07 2. .96 

B. 1. .01 2. .95 

C. z.m, t. o, 2 «- 0 2 “Aq 40 

D. 1. .98 2. .99 

F. 1. .02 2. 01 


ExarcUa 12.2, pofl* l 8 * 

3 55 4. .31 5 - 42 

3 ; ; 60 4. .005 


• — 
5. .05 


3. .50 
3. 05 


4. .06 
4. .04 


5. .95 
5. .10 


fi. .11 
6 . .20 
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H. 1. £. 90 * and z.nt 
5. z.ot and z. H 

I. 1. 53.7 

5. 58.6 and 69.4 

J. 1. 71.8 

5. 80.9 and 101.1 

K. 1. 43.6 

5. 50.9 and 67.1 

L. 1. 0.9 

5. 3.3 and 8.7 


7. 


2. z.oi and z.n 
6. z. io and z.n 
2. 68.2 

6. 48.3 and 79.7 
2. 98.9 

6. 61.6 and 120.4 
2. 65.3 

6. 35.5 and 82.5 

2 . 8.1 

6. —1.8 and 13.8 


• *.** 


z.nt and z.tu 

3. 23 percent 
7. 77.2 
3. 92 percent 
7. 115.7 
3. 18 percent 
7. 78.7 
3. 0 percent 
7. 12.6 


8. Z .MOi 


’•QWv uuvi 

4. 4 percent 
8. 43.4 and 84.6 
4. 0.3 percent 
8. 52.4 and 129.6 
4. 23 percent 
8. 28.1 and 89.9 
4. 100 percent 
8. —4.3 and 16.3 


Exercise 12.4, poge 197 

2. T scores are 29, 38, 45, 52, 59, 66. 

Exercise 13.2, pog* 214 



Distribution of H 






Range of middle 

Range of middle 


Mean 

Standard 

95 percent 

50 percent 


error 

of sample means 

of sample means 

2. 

60 

1.5 

57.1 to 62.9 

59.0 to 61.0 

3. 

40 

2.0 

36.1 to 43.9 

38.7 to 41.3 

4. 

40 

3.0 

34.1 to 45.9 

38.0 to 42.0 

5. 

92 

.8 

90.4 to 93.6 

91.5 to 92.5 

6. 

53 

2.5 

48.1 to 57.9 

51.3 to 54.7 

7. 

39 

.5 

38.0 to 40.0 

38.7 to 39.3 

8. 

39 

1.25 

36.6 to 41.4 

38.2 to 39.8 


Exercise 13.3, pogt 218 


1. 

(a) 

.50 (b) .07 

(c) .84 (d) .68 2. (a) 

.98 (b) 

.87 

4. 

(a) 

64.7 and 85.3 


(b) 67.3 and 82.7 

(c) 

69.8 and 80.2 


(d) 

71.9 and 78.1 


(e) 73.5 and 76.5 

(0 

74.0 and 76.0 

5. 

(a) 

74.4 to 75.6 


(b) 73.6 to 76.4 

(c) 

73.3 to 76.7 


(d) 

72.9 to 77.1 


(e) 72.2 to 77.8 

(0 

71.7 to 78.3 





Exercise 13.4, pegs 222 



2. 

(a) 

(1) 1.83 (2) - 

1.38 

(3) -.88 (4) -2.82 




(b) 

(1) 1-75 (2) - 

1.34 

(3) -.87 (4) -2.60 




(c) 

(1) -.72 and .72 


(2) -1.44 and 1.44 

(3) - 

1.94 and 1 M 



(4) -2.45 and 2.45 

(5) -3.71 and 3.71 


3. 

(1) 

— .69 and .69 


(2) -1.33 and 1.33 

(3) - 

1.73 and 1.73 


(4) 

-2.09 and 2.09 


(5) -2.86 and 2.86 


4. 

95 percent. Answer depends 

only on N. 







Exercise 13.5, page 225 



1. 

(a) 

14.65 and 32.35 


(b) 17.11 and 29.89 

(c) 18. 

25 and 28.75. 

2. 

(a) 

22.0 and 25.0 


(b) 22.3 and 24.7 

(c) 22. 

5 and 24.5. 

3. 

(a) 

20.6 and 26.4 


(b) 21.4 and 25.6. 







Exercise 14.1, page 234 



1. 

l = 

• 16 Cm = 

2.26 

not significant 



2. 

l = 

6.81 Cw = 

2.43 

significant 



3. 

t = 

1.96 t.„ = 

2.43 

not significant 



4. 

t = 

.186 t. n = 

2.43 

not significant 



5. 

l = 

.72 Cm = 

2.03 

not significant 




Exercise 15.1, page 256 

1 . For normal approximation the region of rejection is p > .829, and the hypothesis 
that .tear and hearing are alike is rejected. Hypothesis is also rejected by Table XIIB. 

2 . Consistent. 3 . z = 1.45, not significant at .05 level. 



INDEX 


Abscissa, 60 
Absolute value, 89 
Acceptance 

of hypothesis, 230 
region of, 241 
Action, choice of, 5 . 

Alternative to a hypothesis, 228, 23b 
as related to power, 238 
Angular measure, 40 
Arbitrary origin, 92 
in coded scores, 96 

in computation of 

correlation coefficient, 147 
mean, 93, 95 

regression coefficient, 147^ 
standard deviation, 93, 95 
Arithmetic mean, 87, see also Mean 
Arithmetic scale, 45 
Artificial population, 209 
Attenuation of correlation, 162 
Average, 107 

affected by modifying scores, lib 
qualities desirable in, 108 
Axis of coordinates, 42 

Background traits, controlling effects of, 
227 

Bar graph, 36 
Basic statistics, 6 
Bernoulli, J., 200 
Bessel, F. W., 189 
Best fit, line of, 125 

Bias , . OQ1 

in comparison of two populations, 23 

in estimation of parameter, 215 

Bimodal distribution, 115 

data illustrating, 116 

graphic illustration, 115 
Bivariate 

frequency distribution, 123 
population, 257 
symbols for frequencies, 125 
Block of table, 29 
Bowditch, H. P., 170 
Brace heading, 22 
Brown, W., 155 

Card 

edge marking, 16 
edge punching, 17 
punched,15 
Categories, 9 
ordered, 10 
scaling, 193 
Cell of table, 31 
Centile, 73, see also Percentiles 
Central position, measures of, 107 
Checking computation of 

correlation coefficient, 129, 146 
mean, 97 

regression coefficient, 129, 146 
standard deviation, 97 
Class index, 56 


Class interval, 50 

choice of, 57 -,40 

in correlation and regression, 148 

designation, 56 
limits of, 56 

number in frequency distribution, 08 
width or size of, 55, 148 
Class mark, 56 
Class sort, 56 
Classes, 9 
ordered, 10 
scaled, 10 
Coded scores, 96 
Coefficient of 
correlation, 141 
regression, 128, 135 
variation, 99 
Column 
chart, 38 
heading, 22 
of table, 22 

Combining groups in computation 
of correlation coefficient, 16/ 
of mean and standard deviation, 1 U 1 

Comparing groups bv 
frequency polygons, bl 
means, 98 
ogives, 77 

standard deviations, 98 
Comparing individuals by 
profiles, 197 
ranks, 80 

standard scores, 102 
Comparing populations as to 
correlations, 261 

"'Xn cases are paired,. JW 
when cases are not paired, 231--34 

proportions, 2;>4 
Conditional distribution, 130 
mean of, 131 . 

standard deviation ot. 13/ 
Confidence 

coefficient, 220 

interval, 219, 249, 261 
limits, 220 

Constant, 9 . 

Continuity, correction for, 2o5 
Continuous variable, it), jo 
Coordinates, 00 
Correlation 
chart, 124 

in double dichotomy, 158 
multiple, 172 
partial, 171 
phi coefficient, 158 
among ranks, 158 

n ‘ ,i0 ’ ^ 171 
zero order, 1 /1 . 

Correlation coefficient 

attenuation, 162 

circumstances affecting 

composite group, 160 

297 
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Correlation coefficient, circumstances affect¬ 
ing (coni.) 

errors of measurement, loJ 
grouping scores, 162 
linearity of regression, 163 
number of cases, 163 
range. 168 

selection of cases, 164 
units, 161 

variation in third trait, 170 
computation 

by machine, 144 
from scatter diagram, 148 
confidence interval, 261 
formulas, 142-144 
eampling distribution, 258 
size of, 263 
standard error, 258 
test of hypothesis 
p = 0, 258, 259 

two correlations are equal, 261, 262 
Zr transformation, 259, 273 
Critical region, 231 
Cross-section paper, 42 
Crude mode, 107 
Cumulative 

frequency distribution, 58 
percent curve, 76 
Curvilinear regression, 138, 163 

Decile, 73 
Decision 

as purpose of statistics, 5 
in test of hypothesis, 229, 236 
Degree of algebraic term, 121 
Degree of precision in measurement, 56 
Degrees of freedom in 
table of Student's distribution, 271 
test for correlation coefficient, 259 
test for mean, 222 
De Moivre, A., 200 
Dependent variable, 122 
Deviate, standard normal, 181 
Deviation from 

line of regression, 125 
mean of group, 88 
moan of normal distribution, 181 
standard, sec Standard deviation 
Dichotomy, 9, 240 
Difference between (test of) 

correlation coefficients, 201, 202 
means, 233 
proportions, 254 
Differences, population of, 228 
Discrete variable, 10, 53 
Dispersion, 73 
Distribution 
of data 

bimodal, 115 
bivariate, 123 
conditional, 130 
frequency, 50 
kurtosis in, 113 
peakedness of, 113 
skewness of, 113 
symmetry of, 113 
variability of, 73, 88, 111 
extreme case in, 1GS 
population, 203 

bivariate normal, 257 
dichotomy, 204 , 240 


normal, 178, 179 
probability 
normal, 178 
normal tables, 265, 267 
Student’s or t, 221 
Student’s or t table, 271 
sampling, 211 

of correlation coefficient, 258 
under hypothesis, 242 
of mean, 211 
of proportion, 247 
Student’s or I, 221 
true, 242 
spread of, 73 

Distribution-free test, 253 
Division points of scale, 60 
Dodecile, 73 
Durost, W. N., 115, 262 

Edge marking card, 16 
Edge punched card, 17 
Effectiveness of test of hypothesis, 235 
power of test, 236 
Efficient estimate, 216 
Error . 

in acceptance and rejection of hypothesis, 
235 

curve of, 200 

of estimation in regression, 133 
of first and second kind, 235 
law of, 201 
of meiisurement, 162 
probability of, 211 
probable, 189 
of random sampling, 209 
residual, 128 
risk of, 209 

standard, see Standard error and Standard 
error of estimate 
Type I and Type II, 235 
Estimate, standard error of, 133 
Estimated value in regression equation, 125 
Estimation 
efficient, 216 
interval, 219, 249, 201 
of parameter by statistic, 208, 211 
point, or single value, 214, 249, 258 
unbiased, 215 
Exact relationship, 120 
Extreme case in distribution, 168 

Falsity of hypothesis, 235 

and probability of rejection, 23G-243 
Field of a table, 31 
First-order partial correlation, 171 
Fisher, R. A., 20'.), 271, 272, 273 
Fixed-samplc-size test, 230 
Flanagan, J. O., 1 GO 
Footnote of a table, 28, 31 
Formulas, List of, 287 
Frequency, 22, 50 
density, 03 
marginal, 125 

represented by area, 01, 09, 79 
symbolism fur, 51, 125 
Frequency distribution, see Distribution of 
data 

Frequency polygon, 02 

Gallon, F., 119, 137, 170 
Gaussian curve, 201 
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Generalization from sample, 179, 202 
Geometric mean, 107 
Gossett, W. S., 222 
Graph types 
bar, 36 
column, 39 

cumulative frequency, 63 
histogram, 60 

line, 41 

pictograph, 47 
pie, 

scatter diagram, 124 
semilogarithmic, 43 

of various sires, 

149-153 

curvilinear relationships, 138 _ 

frequency distributions of different forms, 

112,114,115 

line of regression, 128, 131 

normal ’distribution, 177, 181, 183 
power functions, 238 
pro6les, 198 
proportions 

population, 247 
sampling distribution, 218 
regression coefficient, 136 
various statistics, 118 
Greek letters for parameters, 208 

Grouped frequency distribution, 69 

Grouping scores, 55 
effect on value of 

correlation coefficient, lo^ 
mean and median, 108 

Harmonic mean, 107 
Heading 
of table, 28 
of table column, 29 
Hcadnote, 29 

Histogram, 60 . , l77 94 o 

normal curve superimposed, 

Horizontal 
axis, 42 
scale, 42 
Huff, D., 38 

“SrcEn coefficient, 258, 259 
difference between two 
correlations, 261, 262 

means, 233 
proportions, 254 
mean, 228 
proportion, 250 
Hypothesis 

acceptance of, 230 

error of the second kind, zoo 

alternative to, 228, 236 
null, 228 

power of test of, 236 
rejection of, 229 

error of the hist kind, 230 . 

test of, 228, sec also Test of hypothc- 

Hypothetical distribution, 2*2 
• 

I-sha|>eA distribution, 115 
lndeiHindent variable, 122 
Inexact relationship, 123 
Inference, statistical, -08 


Inflection, point of, 116 

Institute of Life Insurance, 32 
Instruments for observations, u 
Interpercentile range, 75 
Interquartile range, 7o 
Interval, see Class interval 
Interval estimate, 219, 249, 261, .« also 
Confidence interval 

J-shaped distribution, 115 

Joint variation of two variables, Zoi 

Kelley. T. L„ 267 
Kurtosis, 113, 114 

Lander, B., 171 

1 -arge numbers, law ot, /Uo 

Law of error, 201 

Law of large numbers, 208 

Least squares, 127 

Leptokurtic distribution, 113 

Lester, A. M., 17 
lev, J., 89. 166, 209, 227 

Level of significance, -30 

test of hypothesis, 236 
relation to power of test, 237 
relation to Type I error, 23o 
Limits 

of class interval 06 
of confidence interval, —0 
Line caption, 22 
Line graph, 41 
Line of regression, l2o 
Linear relationship, 121 
Linearity, assumption o 136 
effect on correlation, 163 
List, table as, 20 
Logarithmic scale 45 

Long, J. A., 227.232 

Lower quartuc, < - 

Marginal frequency. 125 
Matched pairs, 2-7 

McCall. W. A196 
Mean. 87, 1*9.208 
arithmetic, 8 / 
comparison of two, -33 
computation from 
arbitrary origin, 92 
combined groups, 101 
grouped scores, .*4 

of dichotomous jiopulatioi , - 

difference between two, -3. 
effect of modifying scores, 117 

estimation 

by interval, 2-0 

by single value, 214 
geometric, 107 
harmonic, 107 
hypothesis about, 2 -H 
of normal .livlnbution. 179 
of population, 
nroiHirtion as, -b> 

Lf sampling distribution 

of mean, 211 
of pro|H»rtion. 2-1J 

standard error of, -13 

Mean deviation, 8.1 

Measurement, error*, of, 16- 


Median, 66 

computation from 
grouped data, 68 
scattered scores, 73 
efficiency of, in sampling, 216 
standard error of, 216 
Mesokurtic distribution, 113 
Mode, 107 
Moses, L., 80 

Multiple correlation and regression, 172 
Multiple decision, 236 

Nonlinear regression, 138, 163 
Non-normal distribution, 253 
of correlation coefficient, 258 
of proportion, 247 
Student’s, 221 
Nonparametric tests, 253 
Normal bivariate distribution, 257 
Normal curve, 176 
area under, 184, 265 
drawing the curve, 180 
equation of, 184 
family of, 182 
ordinate of, 185, 265 
parameters of, 184 
relation to histogram, 177 
Normal distribution, 176 
as approximation to sampling distribu¬ 
tion of 

correlation coefficient, 258 
mean, 218 
proportion, 248 
classifying data by, 191 
mean of, 179 
parameters of, 184 
percentiles of, 187 
table, 267 
profiles by, 197 

scaling ordered categories by, 193 
stanine scale, 192 
standard, 184 
standard deviation of, 179 
tables of, 265, 267 
transformation to T scores by, 196 
transforming ranks to scores by, 199 
Normality, assumption of, 253 
Number of cases 

effect on value of correlation coefficient, 
163 

in law of large numbers, 208 
relation to power of a test, 237-243 
in standard error of a statistic, 213, 249, 
258 

Number of table, 21 

Octile, 73 
Ogive, 76 

One-sided and two-sided tests, 231 
power of, 237 
Open interval, 50 
Ordered categories, 10 

scaling by normal distribution, 193 
Ordered variables, 10 
Ordinate, 60 

of normal curve, 185, 265 

Parameters, 184 
correlation coefficient, 258 
mean, 181 
proportion, 246 
standard deviation, 184 


use of Greek letters for, 208 
variance, 208 
Parent population, 207 
Partial correlation, 171 
Partition of sum of squares, 133 
Peaked distribution, 113 
Pearse, G. E., 116 
Pearson, K., 90, 137, 201 
Pease, K., 129 
Percentile rank, 82 
computation from 
graph, 83 
grouped data, 83 
Percentiles, 71 
computation from 
graph, 76 
grouped data, 72 
scattered scores, 73 
of distribution of r, 259 
Table VI, 272 
of normal distribution, 187 
Table II, 267 

relation to percentile ranks, 83 
of Student’s distribution, 222 
Table V, 271 
symbolism for, 71 
Percents 
in graph, 37 
in table, 24 

Personnel selection, 156 
validity of tests in, 157 
Phi coefficient, 158 
Pictograph, 47 
Pie chart, 40 

Platykurtic distribution, 113 
Point estimate, 214 
Population, 203 
bivariate normal, 257 
dichotomous, 246 
of differences, 228 
non-normal, 253 
normal, 204, 221 
parent, 207 
two-class, 203 
univariate, 257 
Position, measures of, 106 
Power 

function, 238 
of test of hypothesis, 236 
Precision 

of measurement, 54 
of regression estimate, 142 
Prediction 

expectation of life, 33 
in regression, 119 
Probable error, 189 

Probability, see also Distribution, prob¬ 
ability 

in normal distribution, 190 
paper, 199 

of rejecting a hypothesis, 236 
that an interval contains a parameter, 219 
Profiles, 197 
Proportion, 244 

confidence interval for, 249 
difference between two, 254 
estimation bv a single value, 249 
hypothesis about, 250 
sampling distribution of, 247 
table for testing hypothesis, 253, 282 
Punched card, 15 
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“Q-sort192 
Quantile, 73 
Quartile, 72 
Quetelet, A., 180, 201 

Random numbers, 206 
table of, 280 
Random sample 
simple, 207 
stratified, 207 

Random sampling errors, AH 
Random selection, 205 
Range, 74 

SSESTSS^ delation 98 
restriction of, in test validity, 169 
Rank order, 158 
correlation, 159 

Ranks . 

changing scores to, 81 

percentile, 82 

transformation to scores, 199 

table, 279 , 

Real limits of class interval, 56 
Reciprocals, table of, 2b8 
Records of data, 12, 14 
Region 

of acceptance, 241 

critical, 231 , O oi 

of rejection or significance, 231 

choice of, 240 
Regression, 119 
application oi, los 
coefficient of, 128, 135 
computation by 
machine. 129 
scatter diagram, 148 
conditional distribution, 130 
and correlation, 141 
formula for line of, 127 
linearity, assumption of, wo 
nonlinear, 138, 163 
origin of term, 137 
second line of, 145 
slope of line, 135 . 

Rejection of hypothesis, m 
region of, 231 

Reliability coefficient, 1M 

computation from split halves, 155 

effect of range on, 155 
effect on validity, 150 

Residual error, 128 2 44 

Responses, caution in sampling, ^ ^ 
Restriction of range in test vali y. 

correction for, 169 , . 209 

Risk of error in test of hypothesis, 

Robertson, H. V., 203 
Ruger, H. A., 139 


Sample, 202, 207 

generalization from, 2U* 

simple random, 207 

size of, see Number of cases 

Sampling distribution 1 «« Distribution 

SampfSgstSiility of average, 108 
Scaled variable, 10, 50 
as continuous variable, 55 
Scatter diagram, 124 


Score limits, 56 
Semilogarithmic graph, 45 
Sequential test, 236 

S 'Ts te ^mple of nonparametric test, 

Significance, level of, see Level of sigmfi- 

Size of class interval, 55, 148 
Size of sample, see Number of cases 

Skewness 

of distribution, 113 
left and right, 113 
measure of, 114 
negative and positive, 113 
Slope of line of regression, 135 
Spanner head, 29 

IPZn-B™-Prophecy formula ,55 

Soman's rank order coefficient, 159 
Spread of distribution, 73 
Standard deviation, 90 
computation, 91-97 

in combined groups, 101 
dichotomous population, 246 
effect of modifying scores, 117 
normal distribution, 179 
sampling distribution, 210 
Standard error, 213 

correlation coefficient, 258 
difference between 

means, 233 

proportions, 255 
Zt values, 261 

of estimate or prediction, 1 SI 
mean, 213 

proportion, 249 

Standard score, 102 
for mean, 217 
Stanine scale, 192 
Statistic, 208 oftg 

relation to parameter 208 
Statistical inference, 202, 208 

Statistical inquiry . - 

as boois (or choice of actton, 5 

issues preliminary to, / 

purposes of, 7 

S Jp a ^rvat« a- interval 

!ShX’e B re'.aUon,hip, 120 

Stratdied'randoift sample, 207 

Stub of a table, 22 
Stubhead, 30 

Student’s distribution, 221 
Table V, 271 
Subhead, 30 

partition of, 133 

Symbols 

correlation, 

Glossary of, 284 
mean, 881 208 

*■ percentile) 71 ojr 

pro|>ortion, 245, 246 
quartile, 72 

etawlard deviation, 90, 208 
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Symbols (coni.) 
summation, 87 
variance, 89, 208 
Symmetric distribution, 113 

T score, 196 
i distribution, 221 
Table V, 271 
Tables, basic data 
expectation of life, 32 
motor abilities of 37 pairs, 252 
old-age insurance, 42 
population of United States, 51 
scores in course on statistical methods, 274 
scores of 109 pupils, 276 
Tables, computing procedures 
correlation coefficient, 148 
mean, 95 
median, 70 
percentile rank, 85 
regression, 130, 148 
standard deviation, 95 
test of hypothesis, 232 
Tables, construction 
computing entries by 
sorting, 26 
tallying, 22 

element of a table, 21, 28-30 
Tables to facilitate computation 
r transformed to z r , 272 
random numbers, 280 
ranks transformed to standard scores, 279 
squares, square roots, and reciprocals, 
268, 269 

Tables of probability 
normal, 265-267 
Student’s or t, 271 
Tables, significance values 
correlation coefficient, 272 
proportion, 282 
Tail of distribution, 231 
Tallying, 22 
Terman, L., 196 
Test of hypothesis, 228 
fixed-sample-size, 236 
level of significance, 230, 236, 240 
multiple-decision, 236 
one-sided or one-tailed, 231, 237, 241 
power, 236 

com pu tat ion of, 241 
power function, 238 
region of rejection, 231 
sample size, 237, 240 
sequential, 236 

two-sided or two-tailed, 231, 237, 241, 
see also Hypothesis 
Third trait, effect on correlation, 170 
Thorndike, E. L., 190 
Thorndike, It. L., 166 
Time series, 38 
Title 

of graph, 35 
of table, 21, 29 


Transformation of 

ordered categories to scaled scores, 193 
r to Zr, 259, 273 

ranks to standard scores, 199, 279 
Two-class population, 203 
Two-decision tests, 236 
Two-sided (two-tailed) tests, 231 
Type I and Type II error, 235 

Unbiased estimate, 215 
Unit normal distribution, 184 
Unit of statistical investigation, 8 
Univariate population, 257 
Universe, 203, see also Population 
Upper quartile, 72 
Upper-tailed test, 237 

Validity, 155 
affected by 
reliability, 156 
restriction of range, 168 
in personnel selection, 156 
Variability, 73 
measured by 

interpercentile range, 75 
interquartile range, 75 
range, 74 

standard deviation, 90 
standard error, 213 
variance, 89 

in sampling distribution, 209 
Variable, 9 
continuous, 10 
dependent, 122 
dichotomous, 9 
discrete, 10, 53 
independent, 122 
ordered, 10 
scaled, 10, 50 
Variance, 89 

computation from 
arbitrary origin, 93 
combined groups, 101 
gross scores, 90 
grouped scores, 95 
as parameter, 208 
as square of standard deviation, 90 
Variation, coefficient of, 99 

Walker, II. M., 70, 80, 137, 166, 189, 200, 
209, 227 

Wallis, W. A., 203 
Wert, J. E., 265 

Yates, F., 255, 271, 272, 273 
Yates’ correction, 255 

Z score, 196 

Zero ol der correlation, 171 
Zero point in graph, 61 
z r transformation of r, 259, 273 





